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Description 

FIELD OF THE INVENTION 

The present invention relates to a keto group-introducing enzyme necessary for synthesizing ketocarotenoids, 
such as astaxanthin, which are useful for a red-color enhancing treatment of cultured fishes and shellfishes (such as 
sea bream, salmon and shrimp) and are also applied to foods as a coloring agent or an antioxidant; a DNA coding for 
the above enzyme; a recombinant vector comprising the DNA; a microorganism into which the DNA has been intro- 
duced; and a method for producing ketocarotenoids using the above microorganism. 

BACKGROUND ART 

"Ketocarotenoid" is a general term for keto group-containing carotenoid pigments. Carotenoids are synthesized 
from mevalonic acid as a starting substance via isoprenoid basic biosynthesis pathway which shares an initial part with 
the synthesis pathway for steroids and other isoprenoids (see Fig. 6). Isopentenyl pyrophosphate (IPP) with 5 carbon 
atoms, which is a basic unit, generated from the isoprenoid basic biosynthesis pathway condenses with its isomer 
dimethyiallyl pyrophosphate (DMAPP) to produce geranyl pyrophosphate (GPP) with 10 carbon atoms and, in addition, 
IPP condenses to produce farnesyl pyrophosphate (FPP) with 15 carbon atoms. FPP produces geranylgeranyl pyro- 
phosphate (GGPP) with 20 carbon atoms by condensing with IPP again. Then, GGPPs condense with each other to 
produce colorless phytoene which is the initial carotenoid. Through a series of unsaturated reactions, phytoene is con- 
verted to phytofluene, C-carotene, neurosporene and finally to lycopene. Subsequently, lycopene is converted by a 
cyclization reaction to a p-carotene containing two p-ionone rings. Finally, it is believed that a keto-groups, a hydroxyl 
group, etc. are introduced into the p-carotene to thereby synthesize astaxanthin, zeaxanthin and the like (see Britton, 
G.. "Biosynthesis of Carotenoids". Plant Pigments. Goodwin, T.W (ed.). London. Academic Press, 1988. pp. 133-182). 

Recently, the present inventors have cloned a group of carotenoid biosynthesis genes of the non-photosynthetic 
bacterium Erwina uredovora present in plant from the genomic DNA library in E. coli using its yellow color formation as 
an indicator. Further, by expressing a various combinations of these genes in microorganisms such as E. coft the inven- 
tors has made it possible to produce in microorganisms such as E. co// phytoene, lycopene, p-carotene and zeaxanthin 
which is a yellow carotenoid pigment wherein a hydroxyl group has been introduced into p-carotene (see Fig. 7) (Mis- 
awa, N., Nakagawa, M., Kobayashi, K., Yamano, S., Izawa, Y, Nakamura; K., and Harashima. K., "Elucidation of the 
Erwinia uredovota Carotenoid Biosyrrthetic Pathway by Functional Analysis of Gene Products Expressed in Escherichia 
co/f, J. Bacteriol., 172, pp. 6704-6712, 1990; Misawa, N.. Yamano, S, Ikenaga, H. f "Production of p^arotene in 
Zymomonas mobilis and Agrobacterium tumefaciens by Introduction of the Biosynthesis Genes from Erwinia ure- 
dovora\ Appl. Environ. Microbiol., 57, pp. 1847-1849, 1991; and Japanese Unexamined Patent Publication No. 3- 

58786). , . 

On the other hand, astaxanthin which is a red ketokarotenoid is a representative animal carotenoid widely present 
in marine organisms, e.g. red fishes such as sea bream and salmon, and crustaceans such as crab and shrimp. Since 
animals generally cannot biosynthesize carotenoids, they have to take in from outside those catotenoids synthesized 
by microorganisms or plants. For this reason, astaxanthin has been widely used for the purpose of red color enhancing 
for cultured fishes and shellfishes such as sea bream, salmon and shrimp. 

Astaxanthin is also used as a coloring agent for foods. Furthermore, astaxanthin is attracting attention as an anti- 
oxidant to remove activated oxygen generated in a body which is causative of a cancer (see Takao Matuno and Wataru 
Inui, "Physiological Functions and Biological Activities of Carotenoids in Animals", KAGAKU TO SEIBUTU (Chemistry 
and Organisms), 28, pp. 219-227, 1990). 

As sources of astaxanthin supply, there are known crustaceans such as antarctic krill, a culture of the yeast Phama, 
a culture of the green alga Haematococcus and compounds which are obtained by organic synthesis. However, when 
crustaceans such as antarctic krill are used, it is difficult to separate astaxanthin from various contaminants, such as 
lipids, in a recovery and extraction process, which requires a great labor and cost. When a culture of the yeast Phaffia 
is used, the recovery and extraction of astaxanthin also requires a great cost since its cell wall is rigid and yet the pro- 
duction level of astaxanthin is low. In the case of using a culture of the green alga Haematococcus, it is necessary to 
supply to the alga during its cultivation some light which is essential for astaxanthin synthesis. Therefore, appropriate 
conditions on a location for taking sun light in or cultivation facilities capable of supplying artificial light are required. In 
addition, it is difficult to separate the produced astaxanthin from mixed up chlorophyl and by-products (fatty acid esters). 
For these reasons, it has been true that the organism-derived astaxanthin described above cannot compete with those 
obtained by organic synthesis in cost. However, considering that astaxanthin is used as feed for fishes and shellfishes 
and as a food additive, an astaxanthin prepared by organic synthesis has some problems with respect to by-products 
produced in the reaction and yet such an astaxanthin is against the consumers' liking for natural products. 

Under circumstances, the development of a method for producing an organism-derived cheap astaxanthin which is 
safe and can meet the consumers' liking for natural products is desired. 
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Then, it is believed that the acquisition of a group of genes involved in the biosynthesis of astaxanthin would be very 
useful, because it is possible to render an optimal microorganism with respect of safety as a food and a potential ability 
to produce astaxanthin, regardless of whether it has an ability to produce astaxanthin or not, the production ability by 
introducing into the microorganism the group of astaxanthin synthesis genes and expressing them. In this case, there 

5 will occur no problem of the mixing of by-products. In addition, by using techniques of the highly advanced genetic engi- 
neering, it will not be difficult to increase the amount of astaxanthin production to a level which exceeds the production 
amount by organic synthesis. As described above, a group of genes to synthesize up to zeaxanthin have already been 
obtained by the present inventors from the non-photosynthetic bacterium Erwinia uredovora. However, no one has suc- 
ceeded in obtaining the gene coding for a keto group-introducing enzyme that is necessary for synthesizing astaxan- 

io thin, though a number of attempts have been made in many research institute because of the industrial utility of 
astaxanthin as described above. As to the reasons, it is considered that enzymes located downstream and involved in 
carotenoid biosynthesis, such as a keto group-introducing enzyme, are membrane proteins and that the purification and 
measurement of activity of those enzymes have been impossible; therefore, there has been no finding about those 
enzymes. In particular, as to a keto group-introducing enzyme, not only findings about the enzyme itself but also f ind- 

15 ings about the gene coding for the enzyme have not been reported at all. Therefore, to date, it has been impossible to 
produce astaxanthin in a microorganism or the like by using genetic engineering techniques. 

DISCLOSURE OF THE INVENTION 

20 Accordingly, it is an object of the invention to provide the gene coding for a keto group-introducing enzyme which 
is necessary for producing ketocarotenoids containing keto groups, such as astaxanthin. 
It is another object of the invention to provide a keto group-introducing enzyme. 

It is still another object of the invention to provide a recombinant vector comprising the gene coding for the keto 
group-introducing enzyme. 

25 Further, it is still another object of the invention to provide a microorganism into which the gene coding for the keto 
group-introducing enzyme have been introduced. 

Further, it is still another object of the invention to provide a method for producing ketocarotenoids by using the 
above microorganism into which the gene coding for the keto group-introducing enzyme have been introduced. 

The present inventors have made extensive and intensive researces toward solution of the above assignment and, 

30 as a result, have succeeded in cloning from the cDNA of the green alga Haematococcus the gene coding for a keto 
group-introducing enzyme, preparing a vector DNA incorporating the gene, introducing the vector DNA into E. coli, cul- 
turing the resultant E. coli in a medium, then collecting the cells from the medium, and extracting ketocarotenoids such 
as echinenone, canthaxanthin, astaxanthin, 4-ketozeaxanthin and the like. The present invention has been thus 
achieved. In other words, the invention provides a polypeptide having an enzyme activity to convert the methylene 

35 group at position 4 of a p-ionone ring to a keto group. The invention also provides a DNA comprising a base sequence 
coding for a polypeptide having an enzyme activity to convert the methylene group at position 4 of a p-ionone ring to a 
keto group. Further, the invention provides a recombinant vector comprising the above DNA. The invention also pro- 
vides a microorganism into which the above DNA has been introduced. In addition, the invention provides a method for 
producing ketocarotenoids, comprising culturing in a medium the microorganism into which the DNA has been intro- 

40 duced and extracting ketocarotenoids from the culture cells. 

Hereinbelow, the present invention will be described in more detail. 

1 . Keto group-introducing enzyme 

45 The keto group-introducing enzyme of the invention is a polypeptide having an enzyme activity to convert the meth- 
ylene group at position 4 of a p-ionone ring to a keto group. This polypeptide may be a polypeptide comprising the 
amino acid sequence substantially as shown in SEQ ID NO: 1 of the sequence listing (the amino acid sequence from 
A to D shown in Fig. 1), the amino acid sequence substantially as shown in SEQ ID NO: 2 (the amino acid sequence 
from B to D shown in Fig. 2) or the amino acid sequence substantially as shown in SEQ ID NO: 3 (the amino acid 

so sequence from C to D shown in Fig. 3). The expression "the amino acid sequence substantially as shown in SEQ ID 
NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3 of the sequence listing" used here means an amino acid sequence which may 
have variations such as deletion, substitution, addition, etc. in some of the amino acid residues in the sequence as 
shown in SEQ ID NO: 1. SEQ ID NO: 2 or SEQ ID NO: 3 of the sequence listing as long as such an amino acid 
sequence has the enzyme activity to convert the methylene group at position 4 of a p-ionone ring to a keto group, as 

55 well as the amino acid sequence as shown in SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3. For example, an amino 
acid sequence wherein the first amino acid residue (Met) in SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3 of the 
sequence listing is deleted is included in the above expression. 

In one embodiment, the keto group-introducing enzyme of the invention is able to synthesize canthaxanthin via 
echinenone using p-carotene as a substrate. Also, the enzyme of the invention can convert the methylene group at 
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position 4 of 3-hydroxy-p-ionone ring to a keto group. As one specific example of the above, the enzyme of the invention 
can synthesize astaxarrthin via 4-ketozeaxanthin using zeaxanthin as a substrate (see Fig. 8). Since p-carotene and 
zeaxanthin, which are carotenoids. contain two molecules of p-ionone rings in one molecule, first the methylene group 
at position 4 of one p-ionone ring is converted to a keto group to produce echinenone and 4-ketozeaxanthin, respec- 
tively, and then the methylene group at position 4' (equivalent to position 4) of the other p-ionone ring is converted to a 
keto group to produce canthaxanthin and astaxarrthin, respectively. 

2. Keto group-introducing enzyme gene (bkt) 

The gene coding for the keto group-introducing enzyme of the invention (hereinafter referred to as "bkT) is a DNA 
comprising a base sequence coding for a polypeptide having an enzyme activity to convert the methylene group at posi- 
tion 4 of a p-ionone ring to a keto group. A typical example of this gene is a bkt gene which can be cloned from the green 
alga Haematococcus pluvialis (NIES-144). This is a DNA comprising a base sequence coding for a polypeptide com- 
prising the amino acid sequence which is substantially shown from A to D in Fig. 1 (the amino acid sequence as shown 
in SEQ ID NO: 1 of the sequence listing), the amino acid sequence which is substantially shown from B to D in Fig. 2 
(the amino acid sequence as shown in SEQ ID NO: 2 of the sequence listing), or the amino acid sequence which is sub- 
stantially shown from C to D in Fig. 3 (the amino acid sequence as shown in SEQ ID NO: 3 of the sequence listing). 
Examples for the base sequences coding for the amino acid sequences as shown in SEQ ID NO: 1 , SEQ ID NO: 2 and 
SEQ ID NO: 3 of the sequence listing are given in SEQ ID NOS: 4 and 5, 6 and 7, respectively. The base sequence 
shown in SEQ ID NO: 4 includes a non-coding region in the upstream of the base sequence shown in SEQ ID NO: 5 
which is a coding region. Needless to say, the bkt gene of the invention includes not only the DNAs comprising the base 
sequences shown in SEQ ID NOS: 4, 5, 6 and 7, but also those DNAs comprising a degenerate isomer coding for the 
same polypeptide which is different only in degenerate codons. 

The bkt gene product (hereinafter referred to as "BKT*), i.e., the keto group-introducing enzyme of the present 
invention has, as described above, an enzyme activity to convert the methylene group at position 4 of a p-ionone ring 
in a compound containing p-ionone rings to a keto group. In one embodiment, BKT can synthesize canthaxanthin via 
echinenone using p-carotene as a substrate (see Fig. 8). Further, BKT can also convert the methylene group at position 
4 of 3-hydroxy-p-ionone ring to a keto group. For example, BKT can synthesize astaxanthin via 4-ketozeaxanthin using 
zeaxanthin as a substrate (see Fig. 8). A polypeptide having such an enzyme activity and the DNA coding for it have 
not been known. This polypeptide and the DNA coding therefor do not have an overall homology with any of the 
polypeptides and DNAs which have been known to date. In addition, not limited to the conversion in a p-ionone ring or 
3-hydroxy-p-ionone ring, there has been no finding that one enzyme converts a methylene group immediately to a keto 
group. 

On the other hand, by using the carotenoid synthesis gene group of crtE, crtB, crtl and crtY from the non-photo- 
synthetic bacteria Erwinia. it is possible to render a microorganism such as E. colian ability to produce p-carotene. By 
using crtZ gene in addition to the above four genes, it is possible to render a microorganism such as E. colian ability to 
produce zeaxanthin (see Fig. 7 and WO91/13078, supra). 

Accordingly, since p-carotene and zeaxanthin (which are substrates for BKT) are supplied by these crt gene group 
from Eiwinia, when the DNA of the invention (bkt gene) is further introduced to a microorganism such as E. coli carrying 
the crt gene group from Erwinia, it will become possible for a p-carotene producing microorganism to produce canthax- 
anthin via echinenone and for a zeaxanthin producing microorganism to produce astaxanthin via 4-ketozeaxanthin (see 
Fig. 8). However, in a zeaxanthin producing microorganism, p-cryptoxanthin is contained in an extremely small amount 
as an intermediate. Therefore, in addition to the major metabolic pathway described above, there may be another path- 
way producing astaxanthin from p-cryptoxanthin via 3-hydroxyechinenone and 4-ketozeaxanthin, and still another path- 
way producing phoenicoxanthin from p-cryptoxanthin via 3-hydroxyechinenone or 3'-hydroxyechinenone. As products 
of these minor pathways, it is considered that S'-hydroxyechinenone, 3-hydroxyechinenone and phoenicoxanthin can be 
produced (see Fig. 9). 

3. Acquisition of the DNA 

One means to obtain the DNA comprising a base sequence coding for an amino acid sequence of the keto group- 
introducing enzyme (BKT) of the invention is to chemically synthesize at least a portion of the DNA chain according to 
the conventional nucleic acid synthesizing methods. However, considering the length of sequence, it is preferable not 
to use the chemical synthesis but to obtain mRNA from the green algae Haematococcus (Haematococcus pluvialis and 
Haematococcus lacustris are representative varieties), prepare a cDNA library therefrom using E. coli t and obtain the 
DNA from this library by conventional methods used in the field of genetic engineering, e.g., the hybridization method 
with appropriate probes or the expression cloning method which the inventors have employed. 

Specifically, the total RNA of Haematococcus pluvialis is separated and poly A + RNA is purified using Oligotex- 
dT30 Super (Takara Shuzo). Using this poly A + RNA as a template, cDNA is synthesized with the reverse transcriptase 
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Superscript RT (Gibco BRL) and then double-stranded cDNA is synthesized with E. coli DNA ligase, E. coli DNA 
polymerase and E. coli DNA RNase H (all manufactured by (Gibco BRL). The synthesized cDNA is incorporated in an 
E. coli expression vector pSPORTI (Gibco BRL) and a cDNA library is prepared. Using this cDNA library, a p-carotene 
producing E. co//( E. coli carrying the crt gene group of Erwinia as described above) is transformed. From the changes 

5 in color tone in the resultant transformants, those microorganisms carrying the keto group-introducing enzyme gene are 
screened. This method utilizes the phenomenon that the color tone of E. coli changes from a p-carotene-derived yellow 
to a canthaxanthin-derived red when a keto group has been introduced and carrthaxanthin, one of ketocarotenoids, has 
been synthesized. From the transformed red E coli thus obtained, a plasmid having a cDNA of interest is isolated and 
the cDNA is re-linked to E. coli vectors pBluescript II SK+ and pBluescript II KS+ (Stratagene). With these plasmids, 

w deletion variants having various lengths of deletions are produced and the base sequences of the variants are deter- 
mined. 

4. DNAs which hybridize with the bkt gene 

is To date, several varieties of the green algae Haematococcus have been isolated and identified, and all of them are 
considered to synthesize ketocarotenoids such as astaxanthin. In yeast Phaffia rhodozy ma which is also an eucaryote 
has been reported to synthesize ketocarotenoids such as astaxanthin (Johnson, E.A. and An, G.-Hwan, "Astaxanthin 
from Microbial Sources", Critical Reviews in Biotechnology, 11 , pp. 297-326, 1991). It is possible to obtain other genes 
of keto group-introducing enzymes from the above astaxanthin producing algae or microorganisms by using as a probe 

20 the Haematococcus pluvialis NIES-144 bkt gene as described above and carrying out hybridization by utilizing their 
homology. The present inventors have selected from those Haematococcus capable of synthesizing astaxanthin two 
varieties which are different from Haematococcus pluvialis NIES-144 in assimilation property and phenotype against 
light. They are Haematococcus lacustris UTEX 294 (released from the Culture Collection of Algae at the University of 
Texas at Austin) and Haematococcus lacustris C-392 [released from the Microorganisms and Microalgae Center 

25 belonging to the Applied Microorganism Laboratory (the current Molecular Cell Biology Laboratory), the University of 
Tokyo]. The genomic DNAs from these varieties were prepared and Southern hybridization was conducted using as a 
probe the Haematococcus pluvialis NIES-144 bkt gene. The results were as expected by the inventors. The bkt probe 
strongly hybridized with specific DNA fragments derived from either of the genomic DNA. Therefore, the present inven- 
tion includes those DNAs which hybridize with the above-described DNAs (SEQ ID NOS: 4, 5, 6 and 7). 

30 

5. Transformation of a microorganism such as E. coli 

By introducing the DNA of the invention as a foreign gene into an appropriate microorganism such as bacteria (e.g., 
E. coh\ Zymomonas mobilis t Agrobacterium tumefaciens), yeast (e.g. Saccharomyces cerevisiae), etc and expressing 
35 it, various ketocarotenoids can be produced. 

Hereinbelow, the method for introducing a foreign gene into a preferable microorganism will be described briefly. 
With respect to procedures or methods for introducing a foreign gene into a microorganism such as E coli and express- 
ing the gene, conventional ones used in the field of genetic engineering may be used, as well as the procedures 
described herein. For example, procedures or methods according to those described in "Vectors for Cloning Genes", 
40 Methods in Enzymology, 216, pp. 469-631, 1992, Academic Press and " Other Bacterial Systems", Methods in Enzy- 
mology, 204, pp. 305-636, 1991 , Academic Press may be used. 

{Introduction of the gene into E. co//> 

45 As a method for introducing a foreign gene into E. coll there are several established, effective methods which may 
be used, such as Hanahan's method and the rubidium method (see, for example, Chapter 1 , pp. 74-84, Sambrook, J., 
Frrtsch, E.F., Maniatis, T, "Molecular Cloning, A Laboratory Manual", Cold Springs Harbor Laboratory Press, 1989). For 
the expression of a foreign gene in E. coli, it is preferable, for example, to introduce into E. coli a lac promoter-containing 
E. coli expression vector into which the foreign gene has been inserted according to conventional methods (see, for 

so example, Chapter 17, pp. 3-41 , "Molecular Cloning, A Laboratory Manual" supra). The present inventors have inserted 
the Haematococcus bkt gene into the E. co//cDNA expression vector pSPORTI (Gibco BRL) having a lac promoter etc. 
in a direction so that the inserted gene undergoes a read through of the transcription of the lac promoter, and then intro- 
duced the resultant vector into E coli. 

55 (Introduction of the gene into yeast) 

As a method for introducing a foreign gene into the yeast Sacchromyces cerevisiae, there are established methods 
such as the lithium method which may be used (for example, see "KOHBONO NYUHBAIOTEKUNOROJIH (New Bio- 
technology of Yeast)" edited by the Bioindustry Association under the supervision of Y. AWyama, published by Igaku 
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Shuppan Center). For the expression of a foreign gene in yeast, it is preferable to construct an expression cassette 
using a promoter and a terminator such as PGK and GPD. in which cassette the foreign gene is inserted between the 
promoter and the terminator so that the gene undergoes a read through of the transcription. Then, this expression cas- 
sette is inserted into a vector for S. cerevosiae , for example. YRp system vector (a yeast multicopy vector making the 
ARS sequence in yeast chromosomes as a replication origin). YEp system vector (a yeast multicopy vector having a 
replication origin of yeast 2 jwn DN A). Yip system vector (a vector to be incorporated in yeast chromosomes, not having 
a replication origin of yeast), etc. and the resultant vector is introduced into the yeast (see "New Biotechnology of 
Yeast" supra ; Japan Agricultural & Horticultural Chemistry Association ABC Series "BUSSHITU SEISAN NOTAMENO 
IDENSHIKOUGAKU (genetic Engineering for the Production of Substances)". Asakura Shoten Co.. Ltd.; and \femano. 
S., Ishii, T. Nakagawa. M., Ikenaga, H., and Misawa. N.. "Metabolic Engineering for Production of p-carotene and Lyc- 
opene in Sacchromyces cerevisiae\ Biosci. Biotech. Biochem.. 58. pp. 1112-1114, 1994). 

(Introduction of the gene into Zymomonas mobilise 

-me introduction of a foreign gene into the ethanol producing bacterium Zymomonas mobifis can be achieved by 
the conjugative transfer method which is commonly used for Gram-negative bacteria. For the expression of a foreign 
gene in Zymomonas mobilis, it is preferable, for example, to introduce into Zymomonas mobilis an expression vector 
into which the foreign gene has been inserted (e.g. . vector pZA22 for Zymomonas mobilis) (see Nakamura. K. f "Molec- 
ular Breeding of Zymomonas Bacteria", Journal of Japan Agricultural & Horticultural Chemistry Association, 63, pp. 
1016-1018, 1989; and Misawa. N.. Yamano. S. Ikenaga. H.. "Production of p-carotene in Zymomonas mobilisand Agro- 
bacterium tumefaciens by Introduction of the Biosynthesis Genes from Erwinia uredovomT, AppL Environ. Microbiol., 
57, pp. 1847-1849, 1991). 

(Introduction of the gene into Agrobacterium tumefaciens) 

The introduction of a foreign gene into the plant pathogenic bacterium Agrobacterium tumefaciens can be achieved 
by the conjugative transfer method which is commonly used for Gram-negative bacteria. For the expression of a foreign 
gene in Agrobacterium tumefaciens, it is preferable, for example, to introduce into Agrobacterium tumefaciens an 
expression vector into which the foreign gene has been inserted (e.g.. vector pBI121 for Agrobacterium tumefaciens) 
(see Misawa. N.. Yamano. S. Ikenaga, H.. "Production of p-carotene in Zymomonas mobilis and Agrobacterium tume- 
faciens by Introduction of the Biosynthesis Genes from Erwinia uredovoreT. Appl. Environ. Microbiol.. 57. pp. 1847- 
1849. 1991). 

6. Production of ketocarotenoids by microorganisms (expression of the bkt gene) 

By using the techniques or methods as described above to introduce a foreign gene into a microorganism, it is pos- 
sible to introduce into a microorganism a Haematococct/s<lerived group of ketocarotenoid (including astaxanthm) syn- 
thesis genes and express them. 

Farnesyl pyrophosphate (FPP) is not only a substrate of carotenoids but is also a common substrate of other iso- 
prenoids such as sesquiterpene, triterpene, sterol, hopanol, etc. Generally, microorganisms including those which can- 
not synthesize carotenoids synthesize other isoprenoids. THerefore, basically every microorganism is supposed to have 
FPP as an intermediary metabolite. On the other hand, using FPP as a substrate, the carotenoid synthesis gene group 
of the non-photosynthetic Erwinia is able to synthesize the substrates of the Haematococcus bkt gene product, i.e., up 
to p-carotene and zeaxanthin (see Fig. 7). The present inventors have introduced the Erwinia crt gene group not only 
into E. co// but also the microorganisms described above, (i.e. the yeast Saccharomyces cerevisiae t the ethanol produc- 
ing bacterium Zymomonas mobilis and the plant pathogenic bacterium Agrobacterium tumefaciens) and confirmed that 
these microorganism have become able to produce carotenoids such as p-carotene as expected (see Yamano, S.. Ishii, 
T Nakagawa, M., Ikenaga, H. f and Misawa. N., " Metabolic engineering for production of p-carotene and lycopene in 
Saccharomyces cerevisiaet Biosci., Biotech. Biochem., 58, p. 1112-1114. 1994; Misawa, N., Yamano, S, Ikenaga, H., 
"Production of p-carotene in Zymomonas mobilis and Agrobacterium tumefaciens by introduction of the biosynthesis 
genes from Erwinia uredovora\ Appl. Environ. Microbiol.. 57. pp. 1847-1849, 1991 ; and Japanese Unexamined Patent 
Publication No. 3-58786). ^ kia . 

Accordingly, by introducing a combination of Envwa-derived carotenoid synthesis genes with the DNA of the inven- 
tion (which is typically the Haemafococcus-derived carotenoid synthesis gene bkt) into the same microorganism simul- 
taneously it becomes possible to produce ketocarotenoids such as astaxanthin in all of those microorganisms wherein 
a gene introduction/expression system has been established. Alternatively, by introducing the DNA of the invention into 
a microorganism which inherently has carotenoid synthesis genes or a microorganism into which carotenoid synthesis 
genes have been already introduced, it is also possible to produce ketocarotenoids in the above microorganism. Here- 
inbelow. the production of various ketocarotenoids by microorganisms will be described. 
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(Production of canthaxanthin and echinenone) 

By introducing into a microorganism, such as E. coii, the Erwinia uredovora crtE, crtB, crtl and crtY genes neces- 
sary for the synthesis of p-carotene and the Haematococcus bkt gene which is a keto group-introducing enzyme gene 

s and expressing them, it is possible to allow the microorganism to produce canthaxanthin as a final product. Further- 
more, by regulating the level of expression of the bkt gene or the like, echinenone which is a synthetic intermediate can 
also be obtained. For example, in order to produce canthaxanthin and echinenone in E. coii, both a first plasmid (e.g., 
pACCARI 6AcrtX) obtainable by inserting into an E coii vector (e.g., pACYCI 84) a fragment containing the Erwinia ure- 
dovora crtE, crtB, crtl and crtY genes and a second plasmid [e.g., pHP51 (see Fig. 10)] obtainable by inserting into an 

10 E. coii vector (e.g., pBluescript II KS+) a fragment containing the Haematococcus bkt gene are introduced into E. coii 
(e.g., JM101). The resultant E. coli'is cultured in LB medium, 2YT medium or the like containing ampicillin and chloram- 
phenicol under culture conditions at 30-37 °C until the stationary phase. Then, cells are harvested and carotenoid pig- 
ments are extracted by using an organic solvent such as acetone. Canthaxanthin and echinenone may be contained in 
the carotenoid pigments thus obtained. 

15 

(Production of astaxanthin and 4-ketozeaxanthin) 

By introducing into a microorganism, such as E. coii, the Erwinia uredovora crtE, crtB, crtl, crtY and crtZ genes 
necessary for the synthesis of zeaxanthin and the Haematococcus bkt gene which is a keto group-introducing enzyme 

20 gene and expressing them, it is possible to allow the microorganism to produce astaxanthin as a final product. Further- 
more, by regulating the level of expression of the bkt gene or the like, 4-ketozeaxanthin which is a synthetic intermediate 
can also be obtained. For example, in order to produce astaxanthin and 4-ketozeaxanthin in E. coii, both a first plasmid 
(e.g., pACCAR25AcrtX) obtainable by inserting into an E. coii vector (e.g., pACYC184) a fragment containing the 
Erwinia uredovora crtE, crtB, crtl, crtY and crtZ genes and a second plasmid (e.g., pHP51) obtainable by inserting into 

25 an E. coii vector (e.g.. pBluescript II KS+) a fragment containing the Haematococcus bkt gene are introduced into E. 
coii (e.g., JM101). The resultant E. co//ts cultured in, for example, LB medium or 2YT medium containing ampicillin and 
chloramphenicol under culture conditions at 30-37 °C until the stationary phase. Then, cells are harvested and carote- 
noid pigments are extracted by using an organic solvent such as acetone. Astaxanthin and 4-ketozeaxanthin may be 
contained in the carotenoid pigments thus obtained. 

30 

(Production of 3'-hydroxyechinenone, 3-hydroxyechinenone and phoenicoxanthin) 

By introducing into a microorganism, such as E. coii, the Erwinia uredovora crtE, crtB, crtl, crtY and crtZ genes 
necessary for the synthesis of zeaxanthin and the Haematococcus bkt gene which is a keto group-introducing enzyme 
35 gene and expressing them, it is possible to allow the microorganism to produce astaxanthin and 4-ketozeaxanthin as 
major products. However, as minor intermediary metabolites, 3'-hydroxyechinenone, 3-hydroxyechinenone and phoen- 
icoxanthin should be present in the pathway. 

Methods for producing these pigments are similar to those methods described above. For details, see the Exam- 
ples. 

40 

7. Deposit of the microorganism 

The E. coii DH5a into which plasmid pHP51 incorporating the isolated bkt gene (i.e., the DNA of the invention) has 
been introduced was deposited at the National institute of Bioscience and Human-technology, Agency of Industrial Sci- 
45 ence and Technology, as follows: 

Designation for identification assigned by the depositor: DH5cc (pHP51) 
Accession Number: FERM BP-4757 
Date of Deposit: July 26, 1994 

so BRIEF DESCRIPTION OF DRAWINGS 

Fig. 1 shows the base sequence of a keto group-introducing enzyme gene (bkt) from the green alga Haematococ- 
cus pluvialis NIES-144 and the amino acid sequence of a polypeptide encoded by the above base sequence. 

Fig. 2 shows the base sequence of a keto group-introducing enzyme gene (bkt) from the green alga Haematococ- 
55 cus pfuviaiis NIES-144 and the amino acid sequence of a polypeptide encoded by the the base sequence. 

Fig. 3 shows the base sequence of a keto group-introducing enzyme gene (bkt) from the green alga Haematococ- 
cus pluvialis NIES-144 and the amino acid sequence of a polypeptide encoded by the the base sequence. 

In Figs. 1 to 3 above, the initiation codons are different ones. 
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Fig. 4 shows the base sequence of a DNA chain comprising a keto group-introducing enzyme gene (bkt) from the 
green alga Haematococcus pluvialis NIES-144). A, B and C in the Fig. Show the positions of the initiation codons. 
Fig. 5 shows a sequence which follows the one shown in Fig. 4. 
Fig. 6 shows a carotenoid biosynthesis pathway up to p-carotene. 

Fig. 7 shows the carotenoid biosynthesis pathway of the non-photosynthetic Erwinia uredovora as well as the func- 
tions of carotenoid synthesis genes. 

Fig. 8 shows the functions of the keto group-introducing enzyme gene (bkt) from the green alga Haematococcus 
pluvialis NIES-144, the functions of the hydroxyl group-introducing enzyme gene (crtZ) from the non-photosynthetic 
Erwinia uredovora and major ketocarotenoid biosynthesis pathways. 

Fig. 9 shows the functions of the keto group-introducing enzyme gene (bkt) from the green alga Haematococcus 
pluvialis NIES-144, the functions of the hydroxyl group-introducing enzyme gene (crtZ) from the non-photosynthetic 
Erwinia uredovora and minor ketocarotenoid biosynthesis pathways. 

Fig. 10 shows two plasmids pHP5 and pHP51 each containing the keto group-introducing enzyme gene (bkt) from 
the green alga Haematococcus pluvialis NIES-1 44. 

pHP5 is inserted into pSPORT I and pHP51 into pBluescript II KS+ in such a direction that they undergo the read- 
through of the lac promoter. The sites digested by restriction enzymes are abbreviated as follows: S, Sail; Ss, Sstl; P, 
Pstl; Sp, Sphl; N, Notl; X, Xbal; K. Kpnl; Sa, Sacl. 

Fig 1 1 shows the base sequence for a region including the initiation codons of the keto group-introducing enzyme 
gene (bkt) from the green alga Haematococcus pluvialis NIES-144 and indicates the initiation sites of various deletion 

plasmids. , 
Fig. 12 shows the results of Southern analysis (electrophoresis photo) using as a probe a 1 .7 kb DNA fragment of 

the green alga Haematococcus pluvialis NIES-144 bkt gene. 

Lanes 1 -3: Haematococcus pluvialis N I ES-1 44 

Lanes 4-6: Haematococcus lacustris UTEX294 

Lanes 7-9: Haematococcus lacustris C392 

Lanes 1, 4 and 7: Hindlll digest 

Lanes 2, 5 and 8: Pstl digest 

Lanes 3, 6 and 9: Xbal digest 

BEST MODES FOR CARRYING OUT THE INVENTION 

The present invention will be described more specifically below with reference to the following Examples, which 
should not be construed as limiting the scope of the invention. 

[Example 1] Biomaterials and the Medium Composition 

The Haematococcus pluvialis used for obtaining genes is the NIES-144 strain registered at the Foundation Global 
Environmental Forum. H pluvialis was cultured for 4 days in basal medium (yeast extract 0.2%; sodium acetate 0.12%; 
L-asparagine 0.04%; magnesium chloride • 6H 2 0 0.02%; iron(ll) sulfate • 7H 2 0 0.001%; calcium chloride • 2H 2 0 
0 002%) at 20°C under 12 hr light/12 hr dark cycles (20 jiE/m 2 • s). Further, for the induction of astaxanthin synthesis 
in H pluvialis, acetic acid was added to the H. pluvialis NIES-144 strain to a final concentration of 45 mM and iron(ll) 
sulfate. 7H 2 0 to a final concentration of 450 *im, and the strain was cultured at 20°C at a photointensity of 125 jiE/m* 
• s for about 12 hours to thereby induce the formation of cysts. 

[Example 2] Preparation of the Total DNA from Haematococcus pluvialis 

The H. pluvialis NIES-144 strain was seeded on 400 ml of basal medium and cultured at 20 °C at a photointensity 
of 20 jiE/m 2 • s under 1 2 hr light/1 2 hr dark cycles for about 4 days. Then, cells were harvested from the culture, frozen 
with liquefied nitrogen and crushed in a mortar until the cells became powder-like. To the powder-like cells, 15 ml of 
extraction buffer (0.1 M Tris-Hcl pH 8.0. 0.1 M EDTA, 0.25 M NaCI. 0.1 mg/ml Proteinase K) was added, stirred violently 
and then kept at 55°C for 2 hours. Then, the mixture was centrifuged at 6000xg for 10 minutes at 4'C to remove the 
precipitate. To the supernatant, 0.6 volume of isopropanol was added and cooled at -20°C for 30 minutes. Then, the 
mixture was centrifuged at 7500xg for 15 minutes at 4 °C. The centrifuged material containing DNA was dissolved in 2 
ml of TE buffer (10 mM Tris-HCI pH 8.0. 1 mM EDTA). mixed with the same volume of phenolxhloroform (1 :1) and then 
subjected to centrrfugation to extract the upper layer. Subsequently, 80 jil of 5 M NaCI and 5 mL of ethanol were added 
to the upper layer, cooled at -20°C for 30 minutes and then centrifuged at 12000xg for 15 minutes at 4 C C. The precipitate 
was rinsed with 70% ethanol and then dried. Thereafter, the precipitate was dissolved in 0.5 ml of TE buffer (10 mM 
Tris-HCI pH 8.0. 1 mM EDTA) and 2.5 ^il of 10 mg/ml RNase A was added thereto to make a total DNA solution of Hae- 
matococcus pluvialis. 
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[Example 3] Attempt to Isolate crtZ Homologous Regions from H. pluvialis by PCR 

By comparing amino acid sequences encoded by crtZ genes from Erwinia uredovora and Erwinia herbicola (Mis- 
awa, N., Nakagawa, M., Kobayashi, K., Yamano. S., Izawa, Y, Nakamura, K. and Harashima, K., "Elucidation of the 
5 Erwinia uredovora carotenoid biosynthetic pathway by functional analysis of gene products expressed in Escherichia 
coir, J. Bacterid.. 172, pp. 6704-6712, 1990; Hundle, B.S., Beyer, P., Kleinig, K, Englert, G. and Hearst, J.E., "Carote- 
noids of Erwinia herbicola and an Escherichia cofi HB101 strain carrying the Erwinia herbicola Carotenoid Gene Clus- 
ter", Phytochem. Phytobiol., 54, pp. 89-93. 1991), regions with a high homology were found out. By combining those 
codons which are expected to be used in view of the amino acid sequences of these regions, the following 3 primers 
10 were synthesized to prepare mixed primers. 

No. 1 S'-GGNTGGGGNTGGCAYAARTCNCAYCA-S* 
No. 2 5'-C ANCGYTG RTGN ACN AGNCC RTC RTG-3' 
No. 3 5'-GCRTASATRAANCCRAARCTNACRCA-3' 
[N: A, G, C or T; R: A or G; Y: C or T; S: A, G or J] 
is A mixed primer consisting of No. 1 & No. 2 and another mixed primer consisting of No. 1 & No. 3 were prepared 
and PCR (polymerase chain reaction) was carried out using the total DNA solution of H. pluvialis as templates. The fol- 
lowing materials were mixed so that they have the following final concentrations: about 100 ng total DNA solution of H. 
pluvialis; each 100 p,m mixed primers; IxVent Buffer [10 mM KCI, 20 mM Tris-HCI (pH 8.8), 10 mM (NH 4 ) 2 S0 4 , 2 mM 
MgS0 4 , 0.1% Triton X-100); 250 fiM dNTP; and 2 U Vent DNA polymerase (New England Biolabs, Inc.). The PCR was 
20 conducted 30 cycles with the conditions of at 94°C for 30 seconds. 55 °C for 30 seconds and 72°C for 30 seconds, and 
30 cycles with the conditions of at 94 °C for 30 seconds, 60 °C for 30 seconds and 72°C for 30 seconds. Then, the pres- 
ence of reaction products was confirmed by electrophoresis. However, in any of the cases, a definite, single product has 
not been detected. 

25 [Example 4] Preparation of the Total RNA from Haematococcus pluvialis 

The H. pluvialis NIES-144 strain was seeded on 800 ml of basal medium and cultured at 20 °C at a photointensrty 
of 20 p.E/m 2 • s under 1 2 hr light/1 2 hr dark cycles for about 4 days. Then, acetic acid was added thereto to give a final 
concentration of 45 mM and iron(ll) sulfate • 7H 2 0 to a final concentration of 450 urn. Thereafter, cells were cultured 

30 at 20°C at a photointensrty of 125 jiE/m 2 • s for about 12 hours. Then, cells were harvested from the culture, frozen 
with liquefied nitrogen and crushed in a mortar until the cells became powder-like. To the powder-like cells, 3 ml of ISO- 
GEN-LS (Nippon Gene) was added and left at room temperature for 5 minutes. Further, 0.8 ml of chloroform was added 
thereto. The mixture was violently stirred for 15 seconds and then left at room temperature for 3 minutes. The resultant 
mixture was centrifuged at 12000xg for 15 minutes at 4°C to extract the upper layer. To the upper layer, 2 ml of isopro- 

35 panol was added and left at room temperature for 10 minutes. 

Then, the mixture was centrifuged at 12000xg for 10 minutes at 4°C. Subsequently, the precipitate was rinsed with 
70% ethanol, dried and then dissolved in 1 ml of TE buffer (10 mM Tris-HCI pH 8.0. 1 mM EDTA) to obtain a total RNA 
solution of Haematococcus pluvialis. By the above procedures, 4.1 mg of the total RNA was obtained. 

40 [Example 5] Preparation of the cDNA Expression Library of Haematococcus pluvialis 

Using Oligotex-dT30 Super (Takara Shuzo), poly A+ RNA was purified from approximately 1 mg of the total RNA 
of H. pluvialis according to the manufacture's protocol attached to the product. Approximately 14 p,g of poly A+ mRNA 
was purified by this method. 

45 cDNA was prepared by using Superscript TM Plasmid System (Gibco BRL) according to the attached protocol with 
a partial modification as follows. By using approximately 5 jxg of poly A+ RNA, a complementary DNA strand was syn- 
thesized with a synthetic DNA comprising the recognition sequence of the restriction enzyme Notl and an oligo-dT of 
15-mers as a primer. Subsequently, by using E. coli DNA ligase, E. coli DNA polymerase and E. coli DNA RNase H, a 
double-stranded cDNA was synthesized. To this cDNA, the linker of the restriction enzyme Sail was ligated with T4 DNA 

so ligase so that finally the upstream end of this cDNA would be a Sal I site and the downstream of poly A an Notl site. The 
cDNAs obtained were fractionated by size by electrophoresis and the fractions containing fragments ranging from 0.7 
kb to 3.5 kb were collected. About 28 ng of the cDNAs of these fractions and 35 ng of the cDNA expression vector 
pSPORT I (Gibco BRL) which was digested with Notl and Sail were ligated with the ligation buffer (50 mM Tris-HCI pH 
7.6, 10 mM MgCI 2 , 1 mM ATP, 1 mM DTT, 5% PEG 8000) contained in the above-described kit and T4 DNA ligase. This 

55 cDNA expression vector pSPORT i is a vector having a lac promoter upstream of a Sail site and capable of expressing 
a cDNA in E. coli. Then, using all of the ligated DNA solution, transformation of competent cells of the E. coli DH5ct 
which were prepared according to the method described in Molecular Cloning (2nd edition): Cold Spring Harbor Labo- 
ratory, 1.21-1.41 (1989) was carried out. About 40,000 strains of transformants were obtained. Collecting all of these 
transformants, plasmid DNA was prepared according to the method described in Molecular Cloning (2nd edition): Cold 
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Spring Harbor Laboratory, 1 .21 -1 .41 (1989). As a result. 0.6 mg of plasmid DNA was obtained and this was made the 
cDNA expression library of Haematococcus pluvialis. 

[Example 6] Screening Utilizing the Changes of Color Tone in the Keto Group-Introducing Enzyme Gene Carrying E. 

con 

(1) Preparation of p-carotene producing E. coli 

By subjecting plasmid pCAR16 which contains all of the Erwinia uredovora carotenoid synthesis genes (crtE, crtX, 
crtY crtl and crtB) other than crtZ (see Misawa. N.. Nakagawa. M.. Kbbayashi. K.. Yamano. S., Izawa. Y, Nakamura. K. 
and" Harashima K "Elucidation of the Erwinia uredovora carotenoid biosynthetic pathway by functional analysis of 
gene products expressed in Escherichia coir, J. Bacterid.. 172. pp. 6704-6712. 1990; and Japanese Unexamined Pat- 
ent Publication No. 3-58786) to BstEII digestion. Klenow enzyme treatment and a ligase reaction, the crtX gene was 
deactivated by a frameshift. Then, a 6.0 kb Asp718(Kpnl)-EcoRI fragment was cut out which contains the crtE, crtYcrtl 
and crtB genes necessary for p-carotene production. This fragment was inserted into the EcoRV site of I E coh vector 
pACYC184 (obtained from ATCC 37033) to thereby obtain the plasmid of interest (designated as P ACCARl6AcrtX). 
The E. coli carrying this pACCAR16AcrtX exhibits chloramphenicol resistance and can produce p-carotene. 

(2) Screening for the keto group-introducing enzyme gene 

It is considered that ketocarotenoids are biosynthesized in Haematococcus pluvialis via p-carotene (see Britton, G 
- Biosynthesis of carotenoids". Plant Pigments. Goodwin. W.W. (ed.). London, Academic Press. 1988. pp. 133-182). 
Then/utilizing the phenomenon that E. coli JM101 carrying the plasmid P ACCAR16AcrtX described above produces p- 
carotene (yellow), the cDNA expression library obtained above was introduced to this E. coli. Subsequently, the E. coli 
carrying the keto group-introducing enzyme gene was screened from the color change in the resultant transformants. 
It was expected that, when keto groups were introduced and canthaxanthin (one of ketocarotenoids) began to be pro- 
duced the color of E. coli would change from the yellow of p-carotene to the red of canthaxanthin. 

First using the method described in Molecular Cloning (2nd edition): Cold Spring Harbor Laboratory. 1.21-1.41 
(1989). competent cells of E. coli JM101 carrying pACCAR16AcrtX were prepared. 

Then to 1 ml of these competent cells. 100 ng of the cDNA expression library was introduced, and the screening 
was conducted for about 40.000 transformants to thereby isolate one strain which was reddish and slightly different 
from others in color tone. (The pigment of this strain was identified as canthaxanthin in Example 7.) In addition, the 
cDNA expression plasmid carried by this strain was designated as pHP5. The constitution of plasmid pHP5 is shown in 
Fig. 10. 

[Example 7] Determination of the Base Sequence of the Keto Group-Introducing Enzyme Gene 

A Haematococcus p/uwa//s-derived 1.7 kb cDNA inserted into pPH5 was cut out with the restriction enzymes Sail 
and Xbal. This fragment was inserted into the Sall/Xbal site of both E. coli vector pBluescript II KS+ and E. coh vector 
pBluescript II SK+ to thereby obtain two plasmids (pHP5l and pHP52). Of these plasmids. the restriction imp of pHP51 
is shown in Fig 10. pHP51 and pHP52 are different in the direction of the above cDNA fragment inserted therein. In the 
former plasmid, the cDNA fragment undergoes the read-through of the lac promoter and in the latter the cDNA fragment 

d06S Using the obtained plasmids pHP51 and pHP52. deletion variants having various lengths of deletions were pre- 
pared by the following procedures and their base sequences were determined. pHP51 was digested with Sacl and 
Xbal and pHP52 with Kpnl and Sail. Then, phenol/chloroform extraction was carried out and the DNA was recovered 
by eihanol precipitation. Each DNA was dissolved in 100 pt of Exolll buffer (50 mM Tris-HCI. 100 mM NaCI. 5 mM 
MgCI, 10 mM 2-mercaptoethanol. pH 8.0) and. after the addition of 180 units of Exolll nuclease thereto, kept at 37 C. 
By sampling a 10 nl reaction solution in every 30 seconds, each sample was transfered into a tube containing 10 nl of 
MB buffer (40 mM NaCI. 2 mM ZnCI 2 . 10% glycerol. pH 4.5) located on ice. After the completion of the sampling, the 
10 tubes obtained was kept at 65°C for 10 minutes to deactivate enzymes. Then, 5 units of mung bean nuclease was 
added thereto and kept at 37»C for 30 minutes. After the completion of the reaction. 10 kinds of DNA fragments hav.ng 
varying degrees of deletions were recovered per one plasmid by agarose gel electrophoresis. The recovered DNAs 
were blunt-ended with Klenow enzyme and subjected to ligation reaction at 16 °C overnight, to thereby transform E. col, 
DH5a Plasmids were prepared for the resultant various clones, and sequence reactions were performed by using a flu- 
orescent primer cycle sequence kit manufactured by Applied Biosystems. Then, the base sequence of each plasmid 
was determined with an automatic sequencer. ^ im ^ .. 

The thus determined base sequence consisting of 1677 bp is shown in Figs. 4 and 5 (SEQ ID NO: 4). As a result 
of search for open reading frames. 3 open reading frames have been found which individually have a nbosome binding 
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site at the upstream of the initiation cod on that is necessary for the expression in E. coli These three frames are shown 
individually as A-D in Fig. 1 (SEQ ID NO: 5 in the sequence listing), as B-D in Fig. 2 (SEQ ID NO: 6) and as C-D in Fig. 
3 (SEQ ID NO: 7). As demonstrated in Example 8 infra, a shorter polypeptide than C-D loses the enzyme activity in E. 
coli , and thus it is considered that no initiation codon exists downstream of C. Therefore, the region locating down- 
5 stream of C in Fig. 3 was excluded from the search for open reading frames as described above. 

[Example 8] Determination of the Initiation Codon for the Keto Group-Introducing Enzyme Gene 

Fig. 1 1 shows the base sequence for an upstream portion of the open reading frames described above. There are 
w 5 potential initiation codon sites (base positions 168-170, 189-191, 264-266, 348-350 and 423-425; these sites are 
enclosed with boxes in Fig. 11). The bases at positions 168, 189 and 264 shown in the initiation oodons in Fig. 1 1 cor- 
respond to positions A in Fig. 1 , B in Fig. 2 and C in Fig. 3. respectively. In order to determine the necessary minimum 
region as a functional protein, deletion variants of pHP51 were prepared in substantially the same manner as in Exam- 
ple 5, to thereby obtain several plasmids wherein the upstream region was deleted. Fig. 1 1 shows the number of each 
is of these deletion plasmids and their upstream ends. These plasmids were individually introduced into the E. coli JM101 
carrying pACCAR16AcrtX as described in Example 6 and the pigments produced were identified. As a result, E. coli 
cells carrying deletion plasmids Nos. 30, 27, 31, 37 and 12 were observed to produce carrthaxanthin, but those cells 
carrying deletion plasmids Nos. 10, 6 and 38 were not observed to produce it. With respect to the deletion plasmid No. 
12 which lacks A of the initiation codon ATG at base positions 264-266, this ATG became GTG when a deletion variant 
20 was produced. Since E. coli can recognize even GTG as an initiation codon, it is considered that the synthesis of a pep- 
tide is starting from the initiation codon at this position. Therefore, it has become clear that a polypeptide chain encoded 
by the open reading frame starting from the initiation codon at positions 264-266 [C-D in Fig. 3 (as shown in SEQ ID 
NO: 7)] sufficiently exhibits the enzyme activity of keto group introduction. 

25 [Example 9] Identification of a Ketocarotenoid Pigment 

(1) Identification of carrthaxanthin 

p-carotene producing E. coli JM101 into which pHP5 or pHP51 has been introduced (E. coli pACCAR16AcrtX, 

30 pHP5 or pHP51) (presenting an orange color) was cultured in 2 liters of 2YT medium (1 .6% tryptone, 1% yeast extract, 
0.5% NaCI) containing 150 ng/ml ampicillin (Ap, Meiji Seika Kaisha), 30 fig/ml chloramphenicol (Cm, Sankyo. Co.), 1 
mM IPTG, 7 mg FeS0 4 • 7H 2 0 and 9.3 mg Na 2 • EDTA at 30°C for 24-30 hours. Cells harvested from the culture were 
extracted with 300 ml of acetone, and after concentration, extracted twice with 200 ml of chloroform/methanol (9/1) fol- 
lowed by concentration/drying/caking. The resultant material was dissolved in a small amount of chloroform/methanol 

35 (9/1) and subjected to thin layer chromatography (TLC) using a preparatory silica gel plate (Merck) and developing with 
chloroform/methanol (50/1). By means of this TLC, spots were separated into three with Rf values of 0.53, 0.78 and 1 . 
The most dark red pigment (of Rf value 0.53) representing 75% of the total pigments extracted was recovered from the 
TLC plate. This red pigment was further dissolved in a small amount of chloroform/methanol (9/1 ), applied to Sephadex 
LH-20 column chromatography (15 x 300 mm) and developed and eluted with chloroform/methanol (9/1) or chloro- 

40 form/methanol (1/1), to thereby obtain 2 mg of a pure pigment. All of the ultraviolet-visible spectrum, 1 H-NMR, FD-MS 
spectrum (m/e 564) and mobility on silica gel TLC [the Rf value was 0.53 when developed with chloroform/methanol 
(50/1)] of this substance agreed with those of a standard carrthaxanthin product (BASF), and thus this substance was 
identified as canthaxarrthin (for the structural formula, see Fig. 8). 

Further, a red pigment (having an Rf value of 0.78 on TLC) which represented 10% of the total pigments initially 

45 extracted was recovered from the TLC plate and dissolved in a small amount of methanol. In view of the ultraviolet-vis- 
ible spectrum, mobility on silica gel TLC [the Rf value was 0.78 when developed with chloroform/methanol (50/1)] and 
mobility on HPLC using Novapack HR6 \i C 18 (3.9 x 300 mm) (Waters) [RT was 16 minutes when developed with ace- 
tonitrile/methanol/2-propanol (90/6/4) at a flow rate of 1 .0 ml/min] of this pigment, it was believed to be echinenone (for 
the structural formula, see Fig. 8). 

so Then, a yellow pigment (having an Rf value of 1 on TLC) which represented the remaining 15% of the total pig- 
ments initially extracted was scraped from the TLC plate and dissolved in a small amount of methanol. Since the ultra- 
violet-visible spectrum and mobility on HPLC using Novapack HR6 jut C 18 (3.9 x 300 mm) (Waters) [RT was 62 minutes 
when developed with acetonitrile/methanol/2-propanol (90/6/4) at a flow rate of 1 .0 ml/mon] of this pigment agreed with 
those of a p-carotene standard product (all trans type, Sigma), this substance was found to be an unreacted p-carotene 

55 (for the structural formula, see Fig. 8). 



11 



EP 0 725 137 A1 



(2) Identification of astaxanthin and 4-ketozeaxanthin 

A zeaxanthin-produdng E. co// was prepared as follows. Briefly, plasmid pCAR25 having all of the carotenoid syn- 
thesis genes from Er. uredovora (Misawa. N.. Nakagawa. M., Kobayashi. K.. Yamano. S.. Izawa. Y. Nakamura, K. and 
Harashima K "Eluddation of the Erwinia uredovora carotenoid biosynthetic pathway by functional analysis of gene 
products expressed in Escherichia coir. J. Bacterid.. 172. pp. 670*6712. 1990; and Japanese Unexammed Patent 
Publication No. 3-58786) was subjected to BstEII digestion. Klenow fragment treatment and a Iigase reaction to thereby 
deactivate the crtX gene by a frameshift. Then, a 6.5 kb Asp718(KpnQ-EcoRI fragment was cut out which contains the 
crtE, crtB. crtl. crtY and crtZ genes necessary for zeaxanthin production. This fragment was inserted into the EcoRV 
site of E. co/; vector pACYC184 to thereby obtain the plasmid of interest (designated as pACCAFtt5AcrtX). 

The zeaxanthin-producing E. coli JM101 into which pHP5 or pHP51 has been introduced (E. coli pACCAR25Acrtx, 
pHP5 or pHP51) (presenting an orange color) was cultured in 2 liters of 2YT medium (1 .6% tryptone. 1% yeast extract. 
0.5% NaCI) containing 150 ng/rnl Ap. 30 Mj/ml Cm. 1 mM IPTG. 7 mg FeS0 4 • 7H z O and 9.3 mg Na* • EDTA rtJ 30 C 
for 24-30 hours. Cells harvested from the culture were extracted with 300 ml of acetone, and after concentration, 
extracted twice with 200 ml of chloroform/methanol (9/1) followed by concentration/drying/caking. The resultant mate- 
rial was dissolved in a small amount of chloroform/methanol (9/1) and subjected to thin layer chromatography (TLC) 
using a preparatory silica gel TLC plate (Merck) and developing with chloroform/methanol (1 5/1). By means of this TLC, 
the initial orange pigment was separated into 3 major spots with Rf values of 0.40. 0.54 and 0.72. These pigments were 
recovered from the TLC plate, dissolved separately in a small amount of chloroform/methanol (9/1). applied to Sepha- 
dex LH-20 column chromatography (15 x 300 mm) and developed and eluted with chloroform/methanol (9/1) or meth- 
anol to thereby obtain three pure pigments in amounts of about 1 mg, 1 mg and 2 mg. 

A pigment having an Rf value of 0.72 which represented about half of the total pigments extracted was found to 
have the same planar structure as that of astaxanthin in view of the results of its ultraviolet-visible spectrum, H-NMR 
and FD-MS spectrum (m/e 596). Then, this pigment was dissolved in diethyl etherS-propanolrethanol (5:52) and 
measured the CD spectrum. As a result, this substance was found to take a steric structure of 3S.3'S. Therefore, this 
substance was identified as astaxanthin (for the structural formula, see Fig. 8). Another pigment of Rf 0.54 was identi- 
fied as 4-ketozeaxanthin (for the structural formula, see Fig. 8) in view of the results of its ultrav.olet-vis.ble spectrum. 
1 H-NMR FD-MS spectrum (m/e 582) and mobility on silica gel TLC [the Rf value was 0.54 when developed with chlo- 
roform/methanol (15/1)]. With respect to the pigment having an Rf value of 0.40. its ultraviolet-visible spectrum mobility 
on silica gel TLC [the Rf value was 0.40 when developed with chloroform/methanol (50/1)] and mobility on HPLC using 
Novapack HR6 C 18 (3.9 x 300 mm) (Waters) [RT was 6.5 minutes when developed with acetonrtrile/methano 1/2-pro- 
panol (90/6/4) at a flow rate of 1 .0 ml/min] all agreed with those of a zeaxanthin standard product (BASF). Therefore, 
this substance was found to be an unreacted zeaxanthin (for the structural formula, see Fig. 8). 

From so far described, the functions of the keto group-introducing enzyme gene can be considered as Wtow. 
From (1) of Example 9, it is dear that the Haematococcus-derived keto group-introducing enzyme gene (bkt) is 
coding for a keto group-introducing enzyme (p-carotene ketolase) which catalyzes the conversion of p-carotene (a sub- 
strate) to canthaxanthin via echinenone (see Fig. 8). This shows that one enzyme, BKT. converts the methylene groups 
at positions 4 and 4' of a p-ionone ring directly to keto groups. No enzyme having such a function has been known so 
far In addition from (2) of Example 9. ft is dear that the Haematococcus-derived bkt gene is also coding for another 
keto group-introducing enzyme (zeaxanthin ketolase) which catalyzes the conversion of zeaxanthin (a substrate) to 
astaxanthin via 4-ketozeaxanthin (see Fig. 8). This shows that one enzyme. BKT. converts the methylene groups at 
positions 4 and 4' of 3- and ff-hydroxy-p-ionone rings directly to keto groups. No enzyme having such a function has 
been known so far neither. Accordingly, it can be said that the Haematococcus-derived keto group-introducing enzyme 
gene bkt is coding for an p-ionone or 3-hydroxy-p-ionone ring keto group-introducing enzyme (p-ionone or 3-hydroxy- 
p-ionone ring ketolase) which converts the methylene group at position 4 (4) to a keto group directly, regardless of 
whether a hydroxyl group is added to position 3 (31 Not limited in p-ionone rings or 3-hydroxy-p -lonone rings, there 
has been reported no finding so far that one enzyme converts a methylene group to a keto group directly. 

On the other hand, according to the researches of the present inventors using carotenoid synthesis genes from the 
bacteria Erwinia present in plants and the photosynthetic bacteria Rhodobacter. it has become clear that, generally, a 
carotenoid biosynthesis enzyme recognizes only one half of the carotenoid molecule which is a substrate and acts on 
it For example. crtY which is a lycopene ring formation enzyme gene recognizes by one half of the lycopene molecule 
and makes the ring formation. Therefore, by using the phytoene desaturase gene crtl from Rhodobacter, it » pmsfcte 
to allow E co//to produce neurosporene instead of lycopene. And when the produced neurosporene is treated with the 
Erofl/7/a-derived crtY. the crtY gene product recognizes only the half structure of a neurosporene molecule which is 
common with lycopene and. as a result, p-zeacarotene is produced which is drculized by half (see Linden H.. M'sawa. 
N Chamovitz D.. Pecker, I.. Hirschberg. J. and Sandmann. G.. "Functional complementation in Escherichia coli of dif- 
ferent phytoene desaturase genes and analysis of accumulated carotenes". Z. Naturforsch.. 46c. pp. 1 045-1 051 . 1 99i \y 
In addition, in the present invention also, when p-carotene was treated with BKT. first echinenone is synthesized 
wherein one keto group is introduced, and when zeaxanthin is treated with BKT, first 4-ketozeaxanth.n is synthesized 
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wherein one keto group is introduced. This can be considered that BKT recognizes one half of a substrate molecule and 
introduces a keto group at position 4. On the other hand, the E. coli carrying the E/vwn/a-derived crtE, crtB, crtl. crtY 
and crtZ genes produces zeaxanthin as described above, but p-cryptoxanthin wherein one hydroxyl group is introduced 
into p-carotene can also be detected in the products as an intermediary metabolite. This means that, if BKT is present 

5 there, 3'-hydroxyechinenone and 3-hydroxyechinenone can be produced with the p-cryptoxanthin as a substrate. In 
addition, it can be also considered that BKT further acts on these substances produced to thereby synthesize phoeni- 
coxanthin. This time, we have not achieved the identification of these substances in cultures, because under the condi- 
tions employed for this time it seems that these substances are present only in extremely small amounts. In fact, in the 
typical astaxanthin-producing microorganism Phaffia rhodozyma which is comparable with Haematococcus, 3-hydrox- 

w yechinenone and phoenicoxanthin are detected as intermediary metabolites of astaxanthin (Andrewes, A. G., Phaff, H. 
J. and Starr, M. P., "Carotenoids of Phaffia rhodozyma, a red-pigmented fermenting yeast", Phytochemistry, 15, pp. 
1003-1007, 1976). From so far described, it is possible to consider that there are the minor metabolic pathways shown 
in Fig. 9 other than the major astaxanthin metabolic pathway shown in Fig. 8. 

is [Example 1 0] Southern Analysis of the Genomic DNA of the other Green Algae Haematococcus 

It was examined as to whether some regions showing homology with bkt's isolated in the chromosomes of the other 
green algae Haematococcus. In the same manner as described in Example 2 for preparing the total DNA of Haemato- 
coccus pluvialis NIES-144, the total DNAs of Haematococcus lucustris UTEX 294 and Haematococcus lucustris C-392 

20 were prepared. The resultant DNAs together with the total DNA of H. pluvialis NIES-144 were digested with the restric- 
tion enzyme Hindlll, Pstl or Xbal and separated by agarose gel electrophoresis. The separated DNA fragments were 
denatured with an alkali solution of 0.5 N NaOH/1 .5 M NaCI, and then transferred to a nylon membrane overnight. The 
nylon membrane which had adsorbed DNA was soaked in a hybridization solution (6x Denhardt, 5xSSC. 0.2% SDS, 
100 jig/ml ssDNA) to carry out a prehybridization for 4 hours at 55 °C. Then, a 1.7 kb DNA fragment of bkt gene was 

25 labelled by using Megaprime™ DNA labelling system (Amersham) and [a- 32 P]dCTP (up to 1 10 TBq/mmol) and added 
to the prehybridization solution described above to thereby carry out a hybridization for 16 hours at 55 °C. After the 
hybridization, the reaction solution was washed with 2xSSC and 0.1% SDS at 60 °C for 1 hour and subjected to auto- 
radiography to detect signals indicating homology. As a result, with respect to Haematococcus pluvialis NIES-144, 
strong signals were obtained at positions 15kb, 10 kb and 1.9 kb in Hindlll digest, 6.1 kb, 3.3 kb, 2.8 kb, 2.3 kb. 2.0 kb, 

30 1 .4 kb and 0.8 kb in Pstl digest and 5.1 kb in Xbal digest. With respect to Haematococcus lucustris UTEX 294, strong 
signals were obtained at positions 1 5kb, 7.7 kb and 1 .9 kb in Hindlll digest, 10 kb, 5.0 kb, 4.0 kb, 3.4 kb, 2.9 kb, 1 .5 kb 
and 0.82 kb in Pstl digest and only at a position more than 20 kb in Xbal digest. With respect to Haematococcus lucus- 
tris C-392, strong signals were obtained at positions 15kb, 12 kb and 1.9 kb in Hindlll digest, 6.5 kb, 3.0 kb, 2.3 kb, 2.0 
kb, 1 .4 kb and 0.8 kb in Pstl digest and 5. 3 kb in Xbal digest (see Fig. 12). 

35 

INDUSTRIAL APPLICABILITY 

By introducing into a microorganism such as E. coli as a foreign gene the DNA of the invention coding for an 
enzyme which convert the methylene group at position 4 of p-ionone ring to a keto group and allowing the microorgan- 
40 ism to express the DNA, it has become possible to render a microorganism such as E. coli an ability to biosynthesize 
ketocarotenoids such as astaxanthin, 4-ketozeaxanthin, canthaxanthin, echinenone and other keto group-containing 
ketocarotenoids. By using the microorganism such as E. coli which has been rendered the ability to biosynthesize keto 
group-containing ketocarotenoids, it is possible to produce keto group-containing ketocarotenoids in large quantity with 
small labor and at low cost. 



50 
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SEQUENCE LISTING 



INFORMATION FOR SEQ ID NO: 1 

SEQUENCE CHARACTERISTICS: 

LENGTH: 320 amino acids 

TYPE: amino acids 

TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SOURCE: 

SPECIES: ttaematococcus pluvialis 
STRAIN: NIES-144 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Met His Val Ala Ser Ala Leu Met Val Glu Gin Lys Cly Ser Glu 
15 10 15 

Ala Ala Ala Ser Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin 

20 25 30 

Tyr His Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu 

35 40 45 

Lys His Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly He Thr 

50 55 60 

Met Ala Leu Thr He He Gly Thr Trp Thr Ala Val Phe Leu His 

65 TO T5 

Ala lie Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His 

80 85 90 

Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser 

95 100 105 

Ser Ser Leu Leu His He Ala Ala Val Phe lie Val Leu Glu Phe 

110 H5 120 
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Leu Tyr Thr 
Thr He Ala 
lie Cys He 
Arg Lys His 
Asp Pro Asp 
Ala Ser Phe 
Leu Ala Trp 
Ala Asn Leu 
Phe Arg Leu 
Pro Gly Pro 
Lys Thr Ser 
His Phe Asp 
Trp Trp Gin 
Val Pro Ala 



Gly Leu Phe 

125 
Leu Arg His 

140 
Ser Leu Tyr 

155 

Trp Glu His 
170 

Phe His Lys 

185 
Met Ser Ser 

200 
Trp Ala Val 

215 
Leu Val Phe 

230 
Phe Tyr Phe 

245 
Ala Ala Gly 

260 
Glu Ala Ser 

275 
Leu His Trp 

290 
Leu Pro His 

305 
Leu Ala 

320 



1 le Thr Thr 
Arg Gin Leu 
Ala Trp Phe 
His Asn His 
Gly Asn Pro 
Tyr Met Ser 
Val Met Gin 
Met Ala Ala 
Gly Thr Tyr 
Ser Gin Val 
Asp Val Met 
Glu His His 
Cys Arg Arg 



His Asp Ala 
130 

Asn Asp Leu 
145 

Asp Tyr Ser 
160 

Thr Gly Glu 
175 

Gly Leu Val 
190 

Leu Trp Gin 
205 

Met Leu Gly 
220 

Ala Pro He 
235 

Leu Pro His 
250 

Met Ala Trp 
265 

Ser Phe Leu 
280 

Arg Trp Pro 
295 

Leu Ser Gly 
310 



Met His Gly 
135 

Leu Gly Asn 
150 

Met Leu His 
165 

Val Gly Lys 
180 

Pro Trp Phe 
195 

Phe Ala Arg 
210 

Ala Pro Met 

r 

225 

Leu Ser Ala 
240 

Lys Pro Glu 

255 

Phe Arg Ala 
270 

Thr Cys Tyr 
285 

Phe Ala Pro 
300 

Arg Gly Leu 
315 
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INFORMATION FOR SEQ ID NO: 2 

SEQUENCE CHARACTERISTICS: 

LENGTH: 313 amino acids 

TYPE: amino acids 

TOPOLOGY: linear 
MOLECULAR TYPE: peptide 
SOURCE : 

SPECIES: Haematococcus pluvial is 
STRAIN: NIES-144 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Glu Gin Lys Gly Ser Glu Ala Ala Ala Ser Ser Pro Asp 
1 5 10 15. 

Val Leu Arg Ala Trp Ala Thr Gin Tyr His Met Pro Ser Glu Ser 

20 25 30 

Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro 

35 40 45 

Ala Ser Asp Ala Lys Giy He Thr Met Ala Leu Thr He lie Gly 

50 55 60 

Thr Trp Thr Ala Val Phe Leu His Ala He Phe Gin He Arg Leu 

65 70 75 

Pro Thr Ser Met Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala 

80 85 9Q 

Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser Leu Leu His He Ala 

95 100 105 

Ala Val Phe lie Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe He 

110 115 120 

Thr Thr His Asp Ala Met His Gly Thr He Ala Leu Arg His Arg 

125 130 135 
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Gin Leu Asn Asp Leu Leu Gly Asn lie Cys lie Ser Leu Tyr Ala 

140 145 150 

Trp Phe Asp Tyr Se.r Met Leu His Arg Lys His Trp Glu His His 

155 160 165 

Asn His Thr Gly Glu Val Gly Lys Asp Pro Asp Phe His Lys Gly 

170 175 180 

Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr 

185 190 195 

Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala Val Val 

200 205 210 

Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met 

215 220 225 

Ala Ala Ala Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly 



Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser 

245 250 255 

Gin Val Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp 

260 265 270 

Val Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu 

275 280 285 

His His Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His Cys 

290 295 300 

Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 



230 



235 



240 



305 



310 



313 



INFORMATION 



FOR 



SEQ 



ID NO: 3 



SEQUENCE CHARACTERISTICS: 



LENGTH: 288 amino acids 
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TYPE: amino acids 

TOPOLOGY: linear 
MOLECULAR TYPE-: peptide 
SOURCE : 

SPECIES: Haematococcus pluvialis 
STRAIN: NIES-144 
SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His 

15 10 15 

Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly He Thr Met Ala 

20 25 30 

Leu Thr lie lie Gly Thr Trp Thr Ala Val Phe Leu His Ala He 

35 40 45 

Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His Trp Leu 

50 55 60 

Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser 

65 70 75 

Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr 

80 85 90 

Thr Gly Leu Phe He Thr Thr His Asp Ala Met His Gly Thr He 

95 100 105 

Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn He Cys 

110 115 120 

He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg Lys 

125 130 135 

His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro 

140 145 150 

Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe Ala Ser 

155 160 165 
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Phe Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala 

170 175 180 

Trp Trp Ala Val Va,l Met Gin Met Leu Gly Ala Pro Met Ala Asn 

185 190 195 

Leu Leu Val Phe Met Ala Ala Ala Pro He Leu Ser Ala Phe Arg 

200 205 210 

Leu Phe Tyr Phe Gly Thr Tyr Leu Pro His Lys Pro Glu Pro Gly 

215 220 225 

Pro Ala Ala Gly Ser Gin Val Met Ala Trp Phe Arg Ala Lys Thr 

230 235 240 

Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys Tyr His Phe 

245 250 255 

Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp ( 

260 265 270 

Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro 

275 280 285 

Ala Leu Ala 
288 

INFORMATION FOR SEQ ID NO: 4 



SEQUENCE CHARACTERISTICS: 

LENGTH: 1677 base pairs 

TYPE: nucleic acids 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULAR TYPE: CDNA 
SOURCE: 

SPECIES: H aematococcus pluvialis 
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STRAIN: NIES-144 
SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CGGGGCAACT CAAGAAA.TTC AACAGCTGCA AGCGCGCCCC AGCCTCACAG CGCCAAGTGA 
GCTATCGACG TGGTTGTGAG CGCTCGACGT GGTCCACTGA CGGGCCTGTG AGCCTCTGCG 
CTCCGTCCTC TGCCAAATCT CGCGTCGGGG CCTGCCTAAG TCGAAGAATG CAC GTC 

Met His Val 
1 

GCA TCG GCA CTA ATG GTC GAG CAG AAA GGC AGT GAG GCA GCT GCT TCC. 
Ala Ser Ala Leu Met Val Glu Gin Lys Gly Ser Glu Ala Ala Ala Ser 

5 10 15 

AGC CCA GAC GTC TTG AGA GCG TGG GCG ACA CAG TAT CAC ATG CCA TCC 
Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin Tyr His Met Pro Ser 
20 25 30 35 

GAG TCG TCA GAC GCA GCT CGT CCT GCG CTA AAG CAC GCC TAC AAA CCT 
Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro 

40 45 50 

CCA GCA TCT GAC GCC AAG GGC ATC ACG ATG GCG CTG ACC ATC ATT GGC 
Pro Ala Ser Asp Ala Lys Gly He Thr Met Ala Leu Thr He He Gly 

55 60 65 

ACC TGG ACC GCA GTG TTT TTA CAC GCA ATA TTT CAA ATC AGG CTA CCG ' 
Thr Trp Thr Ala Val Phe Leu His Ala He Phe Gin He Arg Leu Pro 

70 75 80 

ACA TCC ATG GAC CAG CTT CAC TGG TTG CCT GTG TCC GAA GCC ACA; GCC 
Thr Ser Met Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala 

85 90 95 

CAG CTT TTG GGC GGA AGC AGC AGC CTA CTG CAC ATC GCT GCA GTC TTC 
Gin Leu Leu Gly Gly Ser Ser Ser Leu Leu His He Ala Ala Val Phe 
100 105 110 115 

ATT GTA CTT GAG TTC CTG TAC ACT GGT CTA TTC ATC ACC ACA CAT GAC 
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15 



20 



25 



30 



35 



40 



45 



50 



lie Val Leu Glu Phe 
120 
GGC ACC 
Gly Thr 
135 

ATC TGC 
He Cys 



GCA ATG CAT 
Ala Met His 



CTT GGC AAC 
Leu Gly Asn 
150 

CTG CAT CGC 
Leu His Arg 
165 

AAA GAC CCT 
Lys Asp Pro 
180 

GCC AGC TTC 
Ala Ser Phe 

GCA TGG TGG 
Ala Trp Trp 

CTC CTA GTC 
Leu Leu Val 
230 

TTC TAC TTC 
Phe Tyr Phe 

245 
GCA GGC TCT 
Ala Gly Ser 
260 



Leu Tyr Thr Gly Leu Phe 
125 

ATA GCT TTG AGG CAC AGG 

lie Ala Leu Arg His Arg 
140 

TAC GCC TGG 

Tyr Ala Trp 



He Thr Thr His Asp 
130 

GAT CTC 

Asp Leu 



AAG CAC 
Lys His 

GAC TTC 
Asp Phe 

ATG TCC 
Met Ser 
200 
GCA GTG 
Ala Vai 
215 

TTC ATG 
Phe Met 

GGC ACT 
Gly Thr 

CAG GTG 
Gin Val 



ATA TCA CTG 
I le Ser Leu 
155 

TGG GAG CAC 
Trp Glu His 
170 

CAC AAG GGA 
His Lys Gly 
185 

AGC TAC ATG 
Ser Tyr Met 

GTG ATG CAA 
Val Met Gin 

GCT GCA GCC 
Ala Ala Ala 
235 

TAC CTG CCA 
Tyr Leu Pro 

250 
ATG GCC TGG 
Met Ala Trp 
265 



CAC AAC CAT 
His Asn His 

AAT CCC GGC 
Asn Pro Gly 
190 

TCC CTG TGG 
Ser Leu Trp 

205 
ATG CTG GGG 
Met Leu Gly 
220 

CCA ATC TTG 
Pro I le Leu 

CAC AAG CCT 
His Lys Pro 

TTC AGG GCC 
Phe Arg Ala 
270 



CAG CTC AAT 
Gin Leu Asn 
145 

TTT GAC TAC 
Phe Asp Tyr 

160 
ACT GGC GAA 
Thr Gly Glu 
175 

CTT GTC CCC 
Leu Val Pro 

CAG TTT GCC 
Gin Phe Ala 

GCG CCC ATG 
Ala Pro Met 
225 

TCA GCA TTC 
Ser Ala Phe 

240 
GAG CCA GGC 
Glu Pro Gly 
255 

AAG ACA ACT 
Lys Thr Ser 



AGC ATG 
Ser Met 

GTG GGG 
Val Gly 

TGG TTC 
Trp Phe 
195 
CGG CTG 
Arg Leu 
210 

GCA AAT 
Ala Asn 

CGC CTC 
Arg Leu 

CCT GCA 
Pro Ala 

GAG GCA 
Glu Ala 
275 



608 



656 



704 



752 



800 



848 



896 



944 



992 



55 
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TCT GAT GTG ATG AGT TTC CTG ACA TGC TAC CAC TTT GAC CTG CAC TGG 1040 

Ser Asp Val Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp 

2«> 285 290 

GAG CAC CAC AGG TGG CCC TTT GCC CCC TGG TGG CAG CTG CCC CAC TGC 1088 

Glu His His Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His Cys 

295 300 305 



CGC CGC CTG TCC GGG CGT GGC CTG GTG CCT GCC TTG GCA TGACCTGGTC 1137 
Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 



310 


315 


320 




CCTCCGCTGG 


TGACCCAGCG 


TCTGCACAAG 


AGTGTCATGC TACAGGGTGC TGCGGCCAGT 


1197 


GGCAGCGCAG 


TGCACTCTCA 


GCCTGTATGG 


GGCTACCGCT GTGCCACTGA GCACTGGGCA 


1257 


TGCCACTGAG 


CACTGGGCGT 


GCTACTGAGC 


AATGGGCGTG CTACTGAGCA ATGGGCGTGC 


1317 


TACTGACAAT 


GGGCGTGCTA 


CTGGGGTCTG 


GCAGTGGCTA GGATGGAGTT TGAT.GCATTC 


1377 


AGTAGCGGTG 


GCCAACGTCA TGTGGATGGT 


GGAAGTGCTG AGGGGTTTAG GCAGCCGGCA 


1437 


TTTGAGAGGG 


CTAAGTTATA 


AATCGCATGC 


TGCTCATGCG CACATATCTG CACACAGCCA 


1497 


GGGAAATCCC 


TTCGAGAGTG 


ATTATGGGAC 


ACTTGTATTG GTTTCGTGCT ATTGTTTTAT 


1557 


TCAGCAGCAG 


TACTTAGTGA 


GGGTGAGAGC 


AGGGTGGTGA GAGTGGAGTG AGTGAGTATG 


1617 


AACCTGGTCA 


GCGAGGTGAA 


CAGCCTGTAA 


TGAATGACTC TGTCTAAAAA AAAAAAAAAA 


1677 



INFORMATION FOR SEQ ID NO: 5 

SEQUENCE CHARACTERISTICS: 

LENGTH: 963 base pairs 

TYPE: nucleic acids 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULAR TYPE: cDNA 
SOURCE : 

SPECIES: Haematococcus pluvialis 
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STRAIN: NIES-144 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATG CAC CTC GCA TCJ9 GCA CTA ATG GTC GAG CAG AAA GGC ACT GAG 
Met His Val Ala Ser Ala Leu Met Val Glu Gin Lys Gly Ser Glu 
1 5 10 15 

GCA GCT GCT TCC AGC CCA GAC GTC TTG AGA GCG TGG GCG ACA CAG 
Ala Ala Ala Ser Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin 

20 25 30 

TAT CAC ATG CCA TCC GAG TCG TCA GAC GCA GCT CGT CCT GCG CTA 
Tyr His Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu 

35 40 45 

AAG CAC GCC TAC AAA CCT CCA GCA TCT GAC GCC AAG GGC ATC ACG 
Lys His Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly He Thr 

50 55 60 

ATG GCG CTG ACC ATC ATT GGC ACC TGG ACC GCA GTG TTT TTA CAC 
Met Ala Leu Thr He He Gly Thr Trp Thr Ala Val Phe Leu His 

65 70 75 

GCA ATA TTT CAA ATC AGG CTA CCG ACA TCC ATG GAC CAG CTT CAC 
Ala He Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His 

80 85 90 

TGG TTG CCT GTG TCC GAA GCC ACA GCC CAG CTT TTG GGC GGA AGC 
Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser 

95 100 105 

AGC AGC CTA CTG CAC ATC GCT GCA GTC TTC ATT GTA CTT GAG TTC 
Ser Ser Leu Leu His He Ala Ala Val Phe lie Val Leu Glu Phe 
110 115 120 

CTG TAC ACT GGT CTA TTC ATC ACC ACA CAT GAC GCA ATG CAT GGC 
Leu Tyr Thr Gly Leu Phe He Thr Thr His Asp Ala Met His Gly 
125 130 135 
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ACC ATA GCT TTG AGG CAC AGG CAG CTC AAT GAT CTC CTT GGC AAC 450 
Thr lie Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn 
140 145 150 

ATC TGC ATA TCA CTG TAC GCC TGG TTT GAC TAC ACC ATG CTG CAT 495 
lie Cys He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His 
155 160 165 

CGC AAG CAC TGG GAG CAC CAC AAC CAT ACT GGC GAA GTG GGG AAA 540 
Arg Lys His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys 
170 175 180 

GAC CCT GAC TTC CAC AAG GGA AAT CCC GGC CTT GTC CCC TGG TTC 585 
Asp Pro Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe 
185 190 195 

GCC AGC TTC ATG TCC AGC TAC ATC TCC CTG TGG CAG TTT GCC CGG 630 
Ala Ser Phe Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg 
200 205 210 

CTG GCA TGG TGG GCA GTG GTG ATG CAA ATG CTG GGG GCG CCC ATG ■ 675 
Leu Ala Trp Trp Ala Val Val Met Gin Met Leu Gly Ala Pro Met 
215 220 225 

GCA AAT CTC CTA GTC TTC ATG GCT GCA GCC CCA ATC TTG TCA GCA 720 
Ala Asn Leu Leu Val Phe Met Ala Ala Ala Pro He Leu Ser Ala 
230 235 240 

TTC CGC CTC TTC TAC TTC GGC ACT TAC CTC CCA CAC AAG CCT GAG 765 
Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro His Lys Pro Glu ; 

245 250 255 

CCA GGC CCT GCA GCA GGC TCT CAG GTG ATG GCC TGG TTC AGG GCC 810 
Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp Phe Arg Ala 
260 265 270 

AAG ACA AGT GAG GCA TCT GAT GTG ATG AGT TTC CTG ACA TGC TAC 855 
Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys Tyr 
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275 280 285 

CAC TTT GAC CTC CAC TGG GAG CAC CAC AGG TGG CCC TTT GCC CCC 
His Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro 

290 295 300 

TGG TGG CAG CTG CCC CAC TGC CGC CGC CTG TCC GGG CGT GGC CTG 
Trp Trp Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu 

305 310 315 

GTG CCT GCC TTG GCA TGA 963 
Val Pro Ala Leu Ala 

320 

INFORMATION FOR SEQ ID NO: 6 

i 

SEQUENCE CHARACTERISTICS: 

LENGTH: 942 base pairs 

TYPE: nucleic acids 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULAR TYPE: CDNA 
SOURCE: 

SPECIES: Haematococcas pluvialis 
STRAIN: NIES-144 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
ATG GTC GAG CAG AAA GGC AGT GAG GCA GCT GCT TCC AGC CCA GAC 
Met Val Glu Gin Lys Gly Ser Glu Ala Ala Ala Ser Ser Pro Asp 
15 10 15 

GTC TTG AGA GCG TGG GCG ACA CAG TAT CAC ATG CCA TCC GAG TCG 
Val Leu Arg Ala Trp Ala Thr Gin Tyr His Met Pro Ser Glu Ser 
20 25 30 
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TCA GAC 
Ser Asp 

GCA TCT 
Ala Ser 

ACC TGG 
Thr Trp 

CCG ACA 
Pro Thr 

ACA GCC 
Thr Ala 

GCA GTC 
Ala Val 

ACC ACA 
Thr Thr 

CAG CTC 
Gin Leu 

TGG TTT 
Trp Phe 

AAC CAT 
Asn His 



GCA GCT CGT 
Ala Ala Arg 
3p 

GAC GCC AAG 
Asp Ala Lys 
50 

ACC GCA GTG 
Thr Ala Val 
65 

TCC ATG GAC 
Ser Met Asp 
80 

CAG CTT TTG 
Gin Leu Leu 
95 

TTC ATT GTA 
Phe He Val 
110 

CAT GAC GCA 
His Asp Ala 
125 

AAT GAT CTC 
Asn Asp Leu 
140 

GAC TAC AGC 
Asp Tyr Ser 
155 

ACT GGC GAA 
Thr Gly Glu 



CCT GCG 
Pro Ala 

GGC ATC 
Gly He 

TTT TTA 
Phe Leu 

CAG CTT 
Gin Leu 

GGC GGA 
Gly Gly 

CTT GAG 
Leu Glu 

ATG CAT 
Met His 

CTT GGC 
Leu Gly 

ATG CTG 
Met Leu 

GTG GGG 
Val Gly 



CTA AAG 
Leu Lys 

ACG ATG 
Thr Met 

CAC GCA 
His Ala 

CAC TGG 
His Trp 

AGC AGC 
Ser Ser 

TTC CTG 
Phe Leu 

GGC ACC 
Gly Thr 

AAC ATC 
Asn He 

CAT CGC 
His Arg 

AAA GAC 
Lys Asp 



CAC GCC TAC 
His Ala Tyr 
40 

GCG CTG ACC 
Ala Leu Thr 
55 

ATA TTT CAA 
He Phe Gin 
70 

TTG CCT GTG 
Leu Pro Val 
85 

AGC CTA CTG 
Ser Leu Leu 
100 

TAC ACT GGT 
Tyr Thr Gly 
115 

ATA GCT TTG 
He Ala Leu 
130 

TGC ATA TCA 
Cys I le Ser 
145 

AAG CAC TGG 
Lys His Trp 
160 

CCT GAC TTC 
Pro Asp Phe 



AAA CCT CCA 
Lys Pro Pro 
45 

ATC ATT GGC 
lie He Gly 
60 

ATC AGG CTA 
I le Arg Leu 
75 

TCC GAA GCC 
Ser Glu Ala 
90 

CAC ATC GCT 
His He Ala 
105 

CTA TTC ATC 
Leu Phe He 
120 

AGG CAC AGG 
Arg His Arg 
135 

CTG TAC GCC 
Leu Tyr Ala 
150 

GAG CAC CAC 
Glu His His 
165 

CAC AAG GGA 
His Lys Gly 
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170 175 180 

AAT CCC GCC CTT GTC CCC TGG TTC GCC AGC TTC ATG TCC AGC TAC 585 
Asn Pro Gly Leu VaJ Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr 

185 190 195 

ATG TCC CTG TGG CAG TTT GCC CGG CTG GCA TGG TGG GCA GTG GTG 630 
Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala Val Val 

200 205 210 

ATG CAA ATG CTG GGG GCG CCC ATG GCA AAT CTC CTA GTC TTC ATG 675 
Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met 

215 220 225 

GCT GCA GCC CCA ATC TTG TCA GCA TTC CGC CTC TTC TAC TTC GGC 720 
Ala Ala Ala Pro lie Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly 

230 235 240 

ACT TAC CTG CCA CAC AAG CCT GAG CCA GGC CCT GCA GCA GGC TCT 765 
Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser 

245 250 255 . 

CAG GTG ATG GCC TGG TTC AGG GCC AAG ACA ACT GAG GCA TCT GAT 810 
Gin Val Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp 

260 265 270 

GTG ATG AGT TTC CTG ACA TGC TAC CAC TTT GAC CTG CAC TGG GAG 855 
Val Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu 

275 280 285 

CAC CAC AGG TGG CCC TTT GCC CCC TGG TGG CAG CTG CCC CAC TGC 900 
His His Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His Cys 

290 295 300 

CGC CGC CTG TCC GGG CGT GGC CTG GTG CCT GCC TTG GCA TGA 942 
Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 

305 310 313 
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INFORMATION FOR SEQ ID NO: 7 

SEQUENCE CHARACTERISTICS: 

LENGTH: 867 base pairs 

TYPE: nucleic acids 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULAR TYPE: cDNA 
SOURCE: 

SPECIES: Haematococcus pluvialis 
STRAIN: NIES-144 
SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG CCA TCC GAG TCG TCA GAC GCA GCT CGT CCT GCG CTA AAG CAC 

Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His 

1 5 10 15 

GCC TAC AAA CCT CCA GCA TCT GAC GCC AAG GGC ATC ACG ATG GCG 

Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly He Thr Met Ala 

20 25 30 

CTG ACC ATC ATT GGC ACC TGG ACC GCA GTG TTT TTA CAC GCA ATA 

Leu Thr He lie Gly Thr Trp Thr Ala Val Phe Leu His Ala He 

35 40 45 

TTT CAA ATC AGG CTA CCG ACA TCC ATG GAC CAG CTT CAC TGG TTG 

Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His Trp Leu 

50 55 60 

CCT GTG TCC GAA GCC ACA GCC CAG CTT TTG GGC GGA AGC AGC AGC 

Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser 

65 70 75 
CTA CTG CAC ATC GCT GCA GTC TTC ATT GTA CTT GAG TTC CTG TAC 
Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr 
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80 85 90 

ACT GGT CTA TTC ATC ACC ACA CAT GAC GCA ATG CAT GGC ACC ATA 315 

Thr Gly Leu Phe lie Thr Thr His Asp Ala Met His Gly Thr lie 

95 100 - 105 

GCT TTG AGG CAC AGG CAG CTC AAT GAT CTC CTT GGC AAC ATC TGC 360 

Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn lie Cys 

110 115 120 

ATA TCA CTG TAC GCC TGG TTT GAC TAC AGC ATG CTG CAT CGC AAG 405 

He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg Lys 

125 130 135 

CAC TGG GAG CAC CAC AAC CAT ACT GGC GAA GTG GGG AAA GAC CCT 450 

His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro 

140 145 150 

GAC TTC CAC AAG GGA AAT CCC GGC CTT GTC CCC TGG TTC GCC AGC* 495 

Asp Phe His Lys Gly Asn Pro Ciy Leu Val Pro Trp Phe Ala Ser 

155 160 165 

TTC ATG TCC AGC TAC ATG TCC CTG TGG CAG TTT GCC CGG CTG GCA 540 

Phe Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala 

170 175 180 

TGG TGG GCA GTG GTG ATG CAA ATG CTG GGG GCG CCC ATG GCA AAT - 585 

Trp Trp Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn 

185 190 195 

CTC CTA GTC TTC ATG GCT GCA GCC CCA ATC TTG TCA GCA TTC CGC 630 

Leu Leu Vat Phe Met Ala Ala Ala Pro He Leu Ser Ala Phe Arg 

200 205 210 

CTC TTC TAC TTC GGC ACT TAC CTG CCA CAC AAG CCT GAG CCA GGC 675 

Leu Phe Tyr Phe Gly Thr Tyr Leu Pro His Lys Pro Glu Pro Gly 

215 220 225 

CCT GCA GCA GGC TCT CAG GTG ATG GCC TGG TTC AGG GCC AAG ACA 720 
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Pro Ala Ala Gly Ser Gin Val Met Ala Trp Phe Arg Ala Lys Thr 



230 



235 



240 



ACT GAG GCA TCT GAT GTG ATG AGT TTC CTG ACA TGC TAC CAC TTT 
Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys Tyr His Phe 



245 



250 



255 



GAC CTG CAC TGG GAG CAC CAC AGG TGG CCC TTT GCC CCC TGG TGG 
Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp 



260 



265 



270 



CAG CTG CCC CAC TGC CGC CGC CTG TCC GGG CGT GGC CTG GTG CCT 
Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro 



275 

GCC TTG GCA TGA 867 
Ala Leu Ala 



280 



285 



765 



810 



855 



Claims 

1 . A polypeptide having an enzyme activity to convert the methylene group at position 4 of a p-ionone ring in a com- 
pound containing p-ionone rings to a keto group. 

2. The polypeptide of claim 1 . wherein said polypeptide substantially comprises the amino acid sequence as shown 
in SEQ ID NO: 1 of the sequence listing. 

3. The polypeptide of claim 1 , wherein said polypeptide substantially comprises the amino acid sequence as shown 
in SEQ ID NO: 2 of the sequence listing. 

4. The polypeptide of claim 1 . wherein said polypeptide substantially comprises the amino acid sequence as shown 
in SEQ ID NO: 3 of the sequence listing. 

5. The polypeptide of any one of claims 1 to 4, wherein said compound containing p-ionone rings is p-carotene. 

6. The polypeptide of any one of claims 1 to 4, wherein one hydrogen atom at position 3 of said p-ionone ring may be 
replaced with a hydroxyl group. 

7. The polypeptide of claim 6, wherein the compound containing said p-ionone ring where a hydrogen atom at position 
3 is replaced with a hydroxyl group is zeaxanthin. 

8. A DNA comprising a base sequence coding for a polypeptide having an enzyme activity to convert the methylene 
group at position 4 of a p-ionone ring in a compound containing p-ionone rings to a keto group. 

9. The DNA of claim 8. wherein said DNA comprises a base sequence coding for a polypeptide which has an enzyme 
activity to convert the melhylene group at position 4 of a p-ionone ring in a compound containing p-ionone rings to 
a keto group and which polypeptide substantially comprises the amino acid sequence as shown in SEQ ID NO: 1 
of the sequence listing. 
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10. The DNA of claim 9, wherein said base sequence coding for the polypeptide comprising the amino acid sequence 
as shown in SEQ ID NO: 1 of the sequence listing is the base sequence as shown in SEQ ID NO: 4 of the sequence 
listing. 

5 11. The DNA of claim 9, wherein said base sequence coding for the polypeptide comprising the amino acid sequence 
as shown in SEQ ID NO: 1 of the sequence listing is the base sequence as shown in SEQ ID NO: 5 of the sequence 
listing. 

12. The DNA of claim 8, wherein said DNA comprises a base sequence coding for a polypeptide which has an enzyme 
10 activity to convert the methylene group at position 4 of a p-ionone ring in a compound containing p-ionone rings to 

a keto group and which polypeptide substantially comprises the amino acid sequence as shown in SEQ ID NO: 2 
of the sequence listing. 

1 3. The DNA of claim 12, wherein said base sequence coding for the polypeptide comprising the amino acid sequence 
15 as shown in SEQ ID NO:2 of the sequence listing is the base sequence as shown in SEQ ID NO: 6 of the sequence 

listing. 

14. The DNA of claim 8, wherein said DNA comprises a base sequence coding for a polypeptide which has an enzyme 
activity to convert the methylene group at position 4 of a p-ionone ring in a compound containing p-ionone rings to 

20 a keto group and which polypeptide substantially comprises the amino acid sequence as shown in SEQ ID NO:3 
of the sequence listing. 

15. The DNA of claim 1 4, wherein said base sequence coding for the polypeptide comprising the amino acid sequence 
as shown in SEQ ID NO:3 of the sequence listing is the base sequence as shown in SEQ ID NO: 7 of the sequence 

25 listing. 

16. The DNA of any one of claims 8 to 15, wherein said compound containing p-ionone rings is p-carotene. 

1 7. The DNA of any one of claims 8 to 1 5, wherein a hydrogen atom at position 3 of said p-ionone ring may be replaced 
30 with a hydroxyl group. 

18. The DNA of claim 1 7, wherein the compound containing said p-ionone ring where a hydrogen atom at position 3 is 
replaced with a hydroxyl group is zeaxanthin. 

35 19. A DNA which hybridizes with the DNA of any one of claims 8 to 18, and which comprises a base sequence coding 
for a polypeptide having an enzyme activity to convert the methylene group at position 4 of a p-ionone ring in a com- 
pound containing p-ionone rings to a keto group. 

20. A DNA inserted in plasmid pHP51 , which comprises a base sequence coding for a polypeptide having an enzyme 
40 activity to convert the methylene group at position 4 of a p-ionone ring in a compound containing p-ionone rings to 

a keto group. 

21 . A recombinant vector comprising the DNA of claim 8, 19 or 20. 

45 22. A microorganism into which the DNA of claim 8, 19 or 20 has been introduced. 

23. A method for producing a ketocarotenoid, comprising culturing the microorganism of claim 22 in a medium and col- 
lecting a ketocarotenoid from the culture. 

so 24. The method of claim 23, wherein said ketocarotenoid is at least one compound selected from the group consisting 
of echinenone and canthaxanthin. 

25. The method of daim 23, wherein said ketocarotenoid is at least one compound selected from the group consisting 
of 4-ketozeaxanthin and astaxanthin. 

55 

26. The method of claim 23, wherein said microorganism of claim 22 is a bacterium or a yeast. 



31 



EP 0 725 137 A1 



FIG. 1 



f 176 185 194 203 212 221 

ATG CAC GTC GCA TCG GCA CTA ATG GTC GAG CAG AAA GGC AGT GAG GCA GCT GCT 
Met His Val Ala Ser Ala Leu Met Val Glu Gin Lys Gly Ser Glu Ala Ala Ala 
230 239 248 257 266 275 

TCC AGC CCA GAC GTC TTG AGA GCG TGG GCG ACA CAG TAT CAC ATG CCA TCC GAG 
Ser Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin Tyr His Met Pro Ser Glu 
284 293 302 311 320 329 

TCG TCA GAC GCA GCT CGT CCT GCG CTA AAG CAC GCC TAC AAA CCT CCA GCA TCT 
Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro Ala ser 
338 347 356 365 374 383 

GAC GCC AAG GGC ATC ACG ATG GCG CTG ACC ATC ATT GGC ACC TGG ACC GCA GTG 
Asp Ala Lys Gly lie Thr Met Ala Leu Thr He He Gly Thr Trp Thr Ala Val 
392 401 410 419 428 437 

TTT TTA CAC GCA ATA TTT CAA ATC AGG CTA CCG ACA TCC ATG GAC CAG CTT CAC 
*he Leu His Ala He Phe Gin He Arg Leu Pro Thr Ser Met As? Gin Leu His 
446 455 464 473 482 491 

TGG TTG CCT GTG TCC GAA GCC ACA GCC CAG CTT TTG GGC GGA AGC AGC AGC CTA 
Tro Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser Leu 
500 509 518 527 536 =>4:> 

CTG CAC ATC GCT GCA GTC TTC ATT GTA CTT GAG TTC CTG TAC ACT GGT CTA TTC 
Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe 
554 563 572 581 590 599 

ATC ACC ACA CAT GAC GCA ATG CAT GGC ACC ATA GCT TTG AGG CAC AGG CAG CTC 
He Thr Thr His Aso Ala Met His Gly Thr He Ala Leu Arg His Arg Gin Leu 
608 617 626 635 644 6d3 

AAT GAT CTC CTT GGC AAC ATC TGC ATA TCA CTG TAC GCC TGG TTT GAC TAC AGC 
Asn Asp Leu Leu Gly Asn He Cys He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser 
662 671 680 689 698 707 

ATG CTG CAT CGC AAG CAC TGG GAG CAC CAC AAC CAT ACT GGC GAA GTG GGG AAA 
Met Leu His Arg Lys His Tro Glu His His Asn His Thr Gly Glu Val Gly Lys 
716 725 734 743 752 'bl 

GAC CCT GAC TTC CAC AAG GGA AAT CCC GGC CTT GTC CCC TGG TTC GCC AGC TTC 
Aso Pro Asp Phe His Lys Gly Asn Pro Gly Leu val Pro Trp Phe Ala Ser Phe 
770 779 788 797 806 815 

ATG TCC AGC TAC ATG TCC CTG TGG CAG TTT GCC CGG CTG GCA TGG TGG GCA GTG 
Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala val 
824 833 842 851 860 8o9 

GTG ATG CAA ATG CTG GGG GCG CCC ATG GCA AAT CTC CTA GTC TTC ATG GCT GCA 
Val Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met Ala Ala 
878 887 896 905 914 923 

GCC CCA ATC TTG TCA GCA TTC CGC CTC TTC TAC TTC GGC ACT TAC CTG CCA CAC 
Ala Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro His 
932 941 950 959 968 977 

AAG CCT GAG CCA GGC CCT GCA GCA GGC TCT CAG GTG ATG GCC TGG TTC AGG GCC 
Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp Phe Arg Ala 
aoc qqq inn* mil 1022 1031 



98 b yy 5 iuui jluj-j r-ZZ 

AAG ACA AGT GAG GCA TCT GAT GTG ATG AGT TTC CTG ACA TGC TAC CAC TTT GkC 

Lys Thr Ser Glu Ala Ser Asd Val Met Ser Phe Leu Thr Cys Tyr Kis Phe Asp 

1040 1049 * 1058 1067 1076 1035 

CTG CAC TGG GAG CAC CAC AGG TGG CCC TTT GCC CCC TGG TGG CAG CTG CCC CAC 

Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His 

1094 1103 1112 1121 H30 

TGC CGC CGC CTG TCC GGG CGT GGC CTG GTG CCT GCC TTG GCA TGA 
Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 
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FIG. 2 
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368 


GCG 
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791 
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Ala 


Ala 






917 
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FIG. 3 



c 

| 272 281 290 299 308 317 

ATG CCA TCC GAG TCG TCA GAC GCA GCT CGT CCT GCG CTA AAG CAC GCC TAC AAA 
Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys 
326 . 335 344 353 362 371 

CCT CCA GCA TCT GAC GCC AAG GGC ATC ACG ATG GCG CTG ACC ATC ATT GGC ACC 
Pro Pro Ala Ser Asp Ala Lys Gly lie Thr Met Ala Leu Thr lie lie Gly Thr 
380 389 398 407 416 425 

TGG ACC GCA GTG TTT TTA CAC GCA ATA TTT CAA ATC AGG CTA CCG ACA TCC ATG 
Trp Thr Ala Val Phe Leu His Ala lie Phe Gin He Arg Leu Pro Thr Ser Met 
434 443 452 461 470 479 

GAC CAG CTT CAC TGG TTG CCT GTG TCC GAA GCC ACA GCC CAG CTT TTG GGC GGA 
Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly 
488 497 506 515 524 533 

AGC AGC AGC CTA CTG CAC ATC GCT GCA GTC TTC ATT GTA CTT GAG TTC CTG TAC 
Ser Ser Ser Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr 
542 551 560 569 578 587 

ACT GGT CTA TTC ATC ACC ACA CAT GAC GCA ATG CAT GGC ACC ATA GCT TTG AGG 
Thr Gly Leu Phe He Thr Thr His Aso Ala Met His Gly Thr He Ala Leu Arg 
596 605 614 623 632 641 

CAC AGG CAG CTC AAT GAT CTC CTT GGC AAC ATC TGC ATA TCA CTG TAC GCC TGG 
His Arg Gin Leu Asn Asp Leu Leu Gly Asn He Cys He Ser Leu Tyr Ala Trp 
650 659 668 677 686 695 

TTT GAC TAC AGC ATG CTG CAT CGC AAG CAC TGG GAG CAC CAC AAC CAT ACT GGC 
Phe Aso Tyr Ser Met Leu His Arg Lys Kis Trp Glu His His Asn His Thr Gly 
704 713 722 731 740 749 

GAA GTG GGG AAA GAC CCT GAC TTC CAC AAG GGA AAT CCC GGC CTT GTC CCC TGG 
Glu val Gly Lys Asp Pro Asp Phe Kis Lys Gly Asn Pro Gly Leu val Pro Trp 
758 767 776 785 794 803 

TTC GCC AGC TTC ATG TCC AGC TAC ATG TCC CTG TGG CAG TTT GCC CGG CTG GCA 
Phe Ala Ser Phe Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala 
812 821 830 839 848 857 

TGG TGG GCA GTG GTG ATG CAA ATG CTG GGG GCG CCC ATG GCA AAT CTC CTA GTC 
Trp Tro Ala Val Val Met Gin Met Leu .Gly Ala Pro Met Ala Asn Leu Leu Val 
866 875 884 893 902 911 

TTC ATG GCT GCA GCC CCA ATC TTG TCA GCA TTC CGC CTC TTC TAC TTC GGC ACT 
Phe Met Ala Ala Ala Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly Thr 
920 929 938 947 956 965 

TAC CTG CCA CAC AAG CCT GAG CCA GGC CCT GCA GCA GGC TCT CAG GTG ATG GCC 
Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala 
974 983 992 1001 1010 1019 

TGG TTC AGG GCC AAG ACA AGT GAG GCA TCT GAT GTG ATG AGT TTC CTG ACA TGC 
Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys 
1028 1037 1046 1055 1064 1073 

TAC CAC TTT GAC CTG CAC TGG GAG CAC CAC AGG TGG CCC TTT GCC CCC TGG TGG 
Tyr His Phe Asp Leu His Trp Glu His Kis Arg Tro Pro Phe Ala Pro Trp Tr? 

1082 1091 1100 1109 1H8 1127 

CAG CTG CCC CAC TGC CGC CGC CTG TCC GGG CGT GGC CTG GTG CCT GCC TTG GCA 
Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 

TGA D 
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FIG. 4 



30 60 
CGGGGCAACT CAAGAAATTC AACAGCTGCA AGCGCGCCCC AGCCTCACAG CGCCAAGTGA 
GCCCCGTTGA GTTCTTTAAG TTGTCGACGT TCGCGCGGGG TCGGAGTGTC GCGGTTCACT 



90 

GCTATCGACG TGGTTGTGAG CGCTCGACGT GGTCCACTGA 
CGATAGCTGC ACCAACACTC GCGAGCTGCA CCAGGTGACT 



CTCCGTCCTC 
GAGGCAGGAG 
B 



caIi 



CGGCACTAAT 
GCCGTGATTA 



GAGCGTGGGC 
CTCGCACCCG 



150 

TGCCAAATCT CGCGTCGGGG CCTGCCTAAG 
ACGGTTTAGA GCGCAGCCCC GGACGGATTC 

210 

GGTCGAGCAG AAAGGCAGTG AGGCAGCTGC 
CCAGCTCGTC TTTCCGTCAC TCCGTCGACG 

c 

1270 
TGCCAT CCGAGTCGTC 
CTGTGTCATA GTGTACGGTA GGCTCAGCAG 



120 

CGGGCCTGTG AGCCTCTGCG 
GCCCGGACAC TCGGAGACGC 
A 

▼ 180 

TCGAAGAATG CACGTCGCAT 
AGCTTCTTAC GTGCAGCGTA 

240 

TTCCAGCCCA GACGTCTTGA 
AAGGTCGGGT CTGCAGAACT 

300 

AGACGCAGCT CGTCCTGCGC 
TCTGCGTCGA GCAGGACGCG 



330 360 
TAAAGCACGC CTACAAACCT CCAGCATCTG ACGCCAAGGG CATCACGATG GCGCTGACCA 
ATTTCGTGCG GATGTTTGGA GGTCGTAGAC TGCGGTTCCC GTAGTGCTAC CGCGACTGGT 

390 420 
TCATTGGCAC CTGGACCGCA GTGTTTTTAC ACGCAATATT TCAAATCAGG CTACCGACAT 
AGTAACCGTG GACCTGGCGT CACAAAAATG TGCGTTATAA AGTTTAGTCC GATGGCTGTA 

450 480 
CCATGGACCA GCTTCACTGG TTGCCTGTGT CCGAAGCCAC AGCCCAGCTT TTGGGCGGAA 
GGTACCTGGT CGAAGTGACC AACG GAC AC A GGCTTCGGTG TCGGGTCGAA AACCCGCCTT 

510 540 
GCAGCAGCCT ACTGCACATC GCTGCAGTCT TCATTGTACT TGAGTTCCTG TACACTGGTC 
CGTCGTCGGA TGACGTGTAG CGACGTCAGA AGTAACATGA ACTCAAGGAC ATGTGACCAG 

570 600 
TATTCATCAC C AC AC AT G AC GCAATGCATG GCACCATAGC TTTGAGGCAC AGGCAGCTCA 
ATAAGTAGTG GTGTGTACTG CGTTACGTAC CGTGGTATCG AAACTCCGTG TCCGTCGAGT 

630 660 
ATGATCTCCT TGGCAACATC TGCATATCAC TGTACGCCTG GTTTGACTAC AGCATGCTGC 
TACTAGAGGA ACCGTTGTAG ACGTATAGTG ACATGCGGAC CAAACTGATG TCGTACGACG 

690 720 
ATCGCAAGCA CTGGGAGCAC CACAACCATA CTGGCGAAGT GGGGAAAGAC CCTGACTTCC 
TAGCGTTCGT GACCCTCGTG GTGTTGGTAT GACCGCTTCA CCCCTTTCTG GG AC TGAAGG 

750 780 
ACAAGGGAAA TCCCGGCCTT GTCCCCTGGT TCGCCAGCTT CATGTCCAGC TACATGTCCC 
TGTTCCCTTT AGGGCCGGAA CAGGGGACCA AGCGGTCGAA GTACAGGTCG ATGTACAGGG 

810 840 
TGTGGCAGTT TGCCCGGCTG GCATGGTGGG CAGTGGTGAT GCAAATGCTG GGGGCGCCCA 
ACACCGTCAA ACGGGCCGAC CGTACCACCC GTCACCACTA CGTTTACGAC CCCCGCGGGT 
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FIG. 5 



870 900 
TGGCAAATCT CCTAGTCTTC ATGGCTGCAG CCCCAATCTT GTCAGCATTC CGCCTCTTCT 
ACCGTTTAGA GGATCAGAAG TACCGACGTC GGGGTTAGAA CAGTCGTAAG GCGGAGAAGA 

930 960 
ACTTCGGCAC TTACCTGCCA CACAAGCCTG AGCCAGGCCC TGCAGCAGGC TCTCAGGTGA 
TGAAGCCGTG AATGGACGGT GTGTTCGGAC TCGGTCCGGG ACGTCGTCCG AGAGTCCACT 

990 1020 
TGGCCTGGTT CAGGGCCAAG ACAAGTGAGG CATCTGATGT GATGAGTTTC CTGACATGCT 
ACCGGACCAA GTCCCGGTTC TGTTCACTCC GT AG ACT AC A CTACTCAAAG GACTGTACGA 

1050 1080 
ACCACTTTGA CCTGCACTGG GAGCACCACA GGTGGCCCTT TGCCCCCTGG TGGCAGCTGC 
TGGTGAAACT GGACGTGACC CTCGTGGTGT CCACCGGGAA ACGGGGGACC ACCGTCGACG 

1110 H40 
CCCACTGCCG CCGCCTGTCC GGGCGTGGCC TGGTGCCTGC CTTGGCATGA CCTGGTCCCT 
GGGTGACGGC GGCGGACAGG CCCGCACCGG ACCACGGACG GAACCG^ACT GGACCAGGGA 

1170 D 1200 

CCGCTGGTGA CCCAGCGTCT GCACAAGAGT GTCATGCTAC AGGGTGCTGC GGCCAGTGGC 
GGCGACCACT GGGTCGCAGA CGTGTTCTCA CAGTACGATG TCCCACGACG CCGGTCACCG 

1230 I 260 
AGCGCAGTGC ACTCTCAGCC TGTATGGGGC TACCGCTGTG CCACTGAGCA CTGGGCATGC 
TCGCGTCACG TGAGAGTCGG ACATACCCCG ATGGCGACAC GGTGACTCGT GACCCGTACG 

1290 1320 
CACTGAGCAC TGGGCGTGCT ACTGAGCAAT GGGCGTGCTA CTGAGCAATG GGCGTGCTAC 
GTGACTCGTG ACCCGCACGA TGACTCGTTA CCCGCACGAT GACTCGTTAC CCGCACGATG 

1350 1380 
TGACAATGGG CGTGCTACTG GGGTCTGGCA GTGGCTAGGA TGGAGTTTGA TGCATTCAGT 
ACTGTTACCC GCACGATGAC CCCAGACCGT CACCGATCCT ACCTCAAACT ACGTAAGTCA 

1410 I 440 
AGCGGTGGCC AACGTCATGT GGATGGTGGA AGTGCTGAGG GGTTTAGGCA GCCGGCATTT 
TCGCCACCGG TTGCAGTACA CCTACCACCT TCACGACTCC CCAAATCCGT CGGCCGTAAA 

1470 I 500 
GAGAGGGCTA AGTTATAAAT CGCATGCTGC TCATGCGCAC ATATCTGCAC ACAGCCAGGG 
CTCTCCCGAT TCAATATTTA GCGTACGACG AGTACGCGTG TATAGACGTG TGTCGGTCCC 

1530 1560 
AAATCCCTTC GAGAGTGATT ATGGGACACT TGTATTGGTT TCGTGCTATT GTTTTATTCA 
TTTAGGGAAG CTCTCACTAA TACCCTGTGA ACATAACCAA AGCACGATAA CAAAATAAGT 

1590 1*20 
GCAGCAGTAC TTAGTGAGGG TGAGAGCAGG GTGGTGAGAG TGGAGTGAGT GAGTATGAAC 
CGTCGTCATG AATCACTCCC ACTCTCGTCC CACCACTCTC ACCTCACTCA CTCATACTTG 

1650 1677 
CTGGTCAGCG AGGTGAACAG CCTGTAATGA ATGACTCTGT CTAAAAAAAA AAAAAAA 
GACCAGTCGC TCCACTTGTC GGACATTACT TACT GAG AC A GATTTTTTTT TTTTTTT 
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FIG. 6 
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FIG. 7 
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FIG. 8 
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FIG. 9 
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FIG. 11 
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